Optima Network Science Seminar

 

Faculty Advisor: My T. Thai
Coordinator: Thang N. Dinh
Time and place:  1:50pm Wed, E520A CSE Building

Note: If you are interested in participating in the activities of the seminar and/or wish to receive emails about
upcoming events please send an email to  tdinh@cise.ufl.edu.



by Subhankar on Mar 13, 2013

Defining and evaluating network communities based on ground-truth

Jaewon Yang, Jure Leskovec, Proceedings of the ACM SIGKDD Workshop on Mining Data Semantics, 2012 [PDF]

Abstract
Nodes in real-world networks, such as social, information or technological networks, organize into communities where edges appear with high concentration among the members of the community. Identifying communities in networks has proven to be a challenging task mainly due to a plethora of definitions of a community, intractability of algorithms, issues with evaluation and the lack of a reliable gold-standard ground-truth.
We study a set of 230 large social, collaboration and information networks where nodes explicitly define group memberships. We use these groups to define the notion of ground-truth communities. We then propose a methodology which allows us to compare and quantitatively evaluate different definitions of network communities on a large scale. We choose 13 commonly used definitions of network communities and examine their quality, sensitivity and robustness. We show that the 13 definitions naturally group into four classes. We find that two of these definitions, Conductance and Triad-participation-ratio, consistently give the best performance in identifying ground-truth communities.





 

New Post | Archive of Past Seminars