Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3110025.3110141acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
short-paper

Unbiased Sampling of Social Media Networks for Well-connected Subgraphs

Published: 31 July 2017 Publication History

Abstract

Sampling social graphs is critical for studying things like information diffusion. However, it is often necessary to laboriously obtain unbiased and well-connected datasets because existing survey algorithms are unable to generate well-connected samples, and current random-walk based unbiased sampling algorithms adopt rejection sampling, which heavily undermines performance. This paper proposes a novel random-walk based algorithm which implements Unbiased Sampling using Dummy Edges (USDE). It injects dummy edges between nodes, on which the walkers would otherwise experience excessive rejections before moving out from such nodes. We propose a rejection probability estimation algorithm to facilitate the construction of dummy edges and the computation of moving probabilities. Finally, we apply USDE in two real-life social media: Twitter and Sina Weibo. The results demonstrate that USDE generates well-connected samples, and outperforms existing approaches in terms of sampling efficiency and quality of samples.

References

[1]
D. Wang, H. Park, G. Xie, S. Moon, M.-A. Kaafar, and K. Salamatian, "A genealogy of information spreading on microblogs: A galton-watson-based explicative model," in Proceedings IEEE INFOCOM, 2013.
[2]
D. Wang, Z. Li, G. Xie, M. A. Kaafar, and K. Salamatian, "Adwords management for third-parties in sem: An optimisation model and the potential of twitter," in Proceedings of IEEE INFOCOM, 2016.
[3]
M. Gjoka, M. Kurant, C. T. Butts, and A. Markopoulou, "Walking in facebook: A case study of unbiased sampling of osns," in Proceedings of the IEEE INFOCOM, 2010.
[4]
B. Ribeiro, P. Wang, F. Murai, and D. Towsley, "Sampling directed graphs with random walks," in Proceedings of IEEE INFOCOM, 2012.
[5]
C.-E. Sarndal, "Stratified sampling," in Model Assisted Survey Sampling. Springer, 2003.
[6]
M. R. Henzinger, A. Heydon, M. Mitzenmacher, and M. Najork, "On near-uniform url sampling," Computer Networks, vol. 33, no. 1, 2000.
[7]
D. Wang, Z. Li, and G. Xie, "Towards unbiased sampling of online social networks," in Proceedings of the IEEE ICC, 2011.
[8]
T. Wang, Y. Chen, Z. Zhang, P. Sun, B. Deng, and X. Li, "Unbiased sampling in directed social graph," ACM SIGCOMM Computer Communication Review, vol. 41, no. 4, 2011.
[9]
M. Cha, H. Haddadi, F. Benevenuto, and P. K. Gummadi, "Measuring user influence in twitter: The million follower fallacy." in Proceedings of ICWSM, 2010.
[10]
H. Kwak, C. Lee, H. Park, and S. Moon, "What is twitter, a social network or a news media?" in Proceedings of WWW, 2010.
[11]
B. F. Ribeiro and D. Towsley, "On the estimation accuracy of degree distributions from graph sampling," in Proceedings of IEEE Decision and Control (CDC), 2012.
[12]
M. Piraveenan, M. Prokopenko, and A. Zomaya, "Local assortativeness in scale-free networks," EPL (Europhysics Letters), vol. 89, no. 4, p. 49901, 2010.

Cited By

View all
  • (2023)A Fast Dynamic Adaptive Sampling Algorithm for Large-Scale Online Social NetworksComputer Science and Education10.1007/978-981-99-2443-1_30(337-349)Online publication date: 14-May-2023

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASONAM '17: Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017
July 2017
698 pages
ISBN:9781450349932
DOI:10.1145/3110025
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 31 July 2017

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Short-paper
  • Research
  • Refereed limited

Funding Sources

Conference

ASONAM '17
Sponsor:

Acceptance Rates

Overall Acceptance Rate 116 of 549 submissions, 21%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 14 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)A Fast Dynamic Adaptive Sampling Algorithm for Large-Scale Online Social NetworksComputer Science and Education10.1007/978-981-99-2443-1_30(337-349)Online publication date: 14-May-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media