Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1242572.1242805acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

Finding community structure in mega-scale social networks: [extended abstract]

Published: 08 May 2007 Publication History

Abstract

Community analysis algorithm proposed by Clauset, Newman, and Moore (CNM algorithm) finds community structure in social networks. Unfortunately, CNM algorithm does not scale well and its use is practically limited to networks whose sizes are up to 500,000 nodes. We show that this inefficiency is caused from merging communities in unbalanced manner and that a simple heuristics that attempts to merge community structures in a balanced manner can dramatically improve community structure analysis. The proposed techniques are tested using data sets obtained from existing social networking service that hosts 5.5 million users. We have tested three three variations of the heuristics. The fastest method processes a SNS friendship network with 1 million users in 5 minutes (70 times faster than CNM) and another friendship network with 4 million users in 35 minutes, respectively. Another one processes a network with 500,000 nodes in 50 minutes (7 times faster than CNM), finds community structures that has improved modularity, and scales to a network with 5.5 million.

References

[1]
A. Clauset, M. E. J. Newman, and C. Moore. Finding community structure in very large networks. Physical Review E, 70:066111, 2004.
[2]
M. E. J. Newman and M. Girvan. Finding and evaluating community structure in networks. Physical Review E, 69:026113, 2004.
[3]
F. Radicchi, C. Castellano, F. Cecconi, V. Loreto, and D. Parisi. Defining and identifying communities in networks. Proc. Natl. Acad. Sci. USA, 101:2658, 2004.
[4]
Ken Wakita and Toshiyuki Tsurumi. Finding community structure in mega-scale social networks, February 2007, cs.CY/0702048. http://arxiv.org/abs/cs.CY/0702048v1.

Cited By

View all
  • (2024)Implicit Virtual Communities in Social NetworksBuilding Power, Safety, and Trust in Virtual Communities10.4018/979-8-3693-3868-1.ch007(145-172)Online publication date: 22-Nov-2024
  • (2024)Response Item Network (ResIN): A network-based approach to explore attitude systemsHumanities and Social Sciences Communications10.1057/s41599-024-03037-x11:1Online publication date: 8-May-2024
  • (2024)Uncovering migration systems through spatio-temporal tensor co-clusteringScientific Reports10.1038/s41598-024-78112-z14:1Online publication date: 6-Nov-2024
  • Show More Cited By

Index Terms

  1. Finding community structure in mega-scale social networks: [extended abstract]

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      WWW '07: Proceedings of the 16th international conference on World Wide Web
      May 2007
      1382 pages
      ISBN:9781595936547
      DOI:10.1145/1242572
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 08 May 2007

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. clustering
      2. community analysis
      3. social networking system

      Qualifiers

      • Article

      Conference

      WWW'07
      Sponsor:
      WWW'07: 16th International World Wide Web Conference
      May 8 - 12, 2007
      Alberta, Banff, Canada

      Acceptance Rates

      Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)53
      • Downloads (Last 6 weeks)1
      Reflects downloads up to 26 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Implicit Virtual Communities in Social NetworksBuilding Power, Safety, and Trust in Virtual Communities10.4018/979-8-3693-3868-1.ch007(145-172)Online publication date: 22-Nov-2024
      • (2024)Response Item Network (ResIN): A network-based approach to explore attitude systemsHumanities and Social Sciences Communications10.1057/s41599-024-03037-x11:1Online publication date: 8-May-2024
      • (2024)Uncovering migration systems through spatio-temporal tensor co-clusteringScientific Reports10.1038/s41598-024-78112-z14:1Online publication date: 6-Nov-2024
      • (2024)PSA-GNNNeural Networks10.1016/j.neunet.2024.106155173:COnline publication date: 2-Jul-2024
      • (2024)MST-GNN: graph neural network with multi-granularity in space and time for traffic predictionGeoInformatica10.1007/s10707-024-00532-wOnline publication date: 15-Nov-2024
      • (2024)Online Public Debate. How Can We Make It More Intelligent?Collective Intelligence in Open Policymaking10.1007/978-3-031-58191-5_4(225-299)Online publication date: 30-May-2024
      • (2023)Fuelling the climate and science ‘denial machine’ on social media: A case study of the Great Barrier Reef’s 2021 ‘in danger’ recommendation on Twitter, YouTube and FacebookPublic Understanding of Science10.1177/0963662523120211733:3(270-289)Online publication date: 26-Oct-2023
      • (2023)Structural Reconstruction of Signed Social NetworksIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.318601710:5(2599-2612)Online publication date: Oct-2023
      • (2023)Efficient Storage Management for Social Network Events Based on Clustering and Hot/Cold Data ClassificationIEEE Transactions on Computational Social Systems10.1109/TCSS.2022.314631010:1(120-130)Online publication date: Mar-2023
      • (2023)Siloed discourses: a year-long study of twitter engagement on the use of CRISPR in food and agricultureNew Genetics and Society10.1080/14636778.2023.224836342:1Online publication date: 22-Aug-2023
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media