Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1060590.1060693acmconferencesArticle/Chapter ViewAbstractPublication PagesstocConference Proceedingsconference-collections
Article

On the bias of traceroute sampling: or, power-law degree distributions in regular graphs

Published: 22 May 2005 Publication History

Abstract

Understanding the structure of the Internet graph is a crucial step for building accurate network models and designing efficient algorithms for Internet applications. Yet, obtaining its graph structure is a surprisingly difficult task, as edges cannot be explicitly queried. Instead, empirical studies rely on traceroutes to build what are essentially single-source, all-destinations, shortest-path trees. These trees only sample a fraction of the network's edges, and a recent paper by Lakhina et al. found empirically that the resuting sample is intrinsically biased. For instance, the observed degree distribution under traceroute sampling exhibits a power law even when the underlying degree distribution is Poisson.In this paper, we study the bias of traceroute sampling systematically, and, for a very general class of underlying degree distributions, calculate the likely observed distributions explicitly. To do this, we use a continuous-time realization of the process of exposing the BFS tree of a random graph with a given degree distribution, calculate the expected degree distribution of the tree, and show that it is sharply concentrated. As example applications of our machinery, we show how traceroute sampling finds power-law degree distributions in both δ-regular and Poisson-distributed random graphs. Thus, our work puts the observations of Lakhina et al. on a rigorous footing, and extends them to nearly arbitrary degree distributions.

References

[1]
L. Amini, A. Shaikh, and H. Schulzrinne. Issues with inferring Internet topological attributes. In SPIE IT-Com, 2002.
[2]
P. Barford, A. Bestavros, J. Byers, and M. Crovella. On the marginal utility of network topology measurements. In SIGCOMM Internet Measurement Workshop, 2001.
[3]
B. Bollobás. Random graphs. Academic Press, 1985.
[4]
B. Bollobás and F. Chung. The diameter of a cycle plus a random matching. SIAM J. Disc. Math., 1:328--333, 1988.
[5]
Q. Chen, H. Chang, R. Govindan, S. Jamin, S. J. Shenker, and W. Willinger. The origin of power laws in Internet topologies revisited. In Proc. 21st ACM SIGCOMM Conference, 2002.
[6]
A. Clauset and C. Moore. Accuracy and scaling phenomena in Internet mapping. Physical Review Letters, 94:018701, 2005.
[7]
P. Erdős and A. Rényi. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci., 5:17--61, 1960.
[8]
A. Fabrikant, E. Koutsoupias, and C. Papadimitriou. Heuristically optimized trade-offs: A new paradigm for power laws in the Internet. In Proc. 29th Intl. Colloq. on Automata, Languages and Programming, pages 110--122, 2002.
[9]
M. Faloutsos, P. Faloutsos, and C. Faloutsos. On power-law relationships of the Internet topology. In Proc. ACM SIGCOMM Conference, pages 251--262, 1999.
[10]
R. Govindan and H. Tangmunarunkit. Heuristics for Internet map discovery. In Proc. 19th ACM SIGCOMM Conference, 2000.
[11]
W. Hoeffding. Probability inequalities for sums of bounded random variables. J. of the American Statistical Association, 58:13--30, 1963.
[12]
J. H. Kim. Poisson cloning model for random graphs. Preprint, 2004.
[13]
J. Kleinberg, S. Kumar, P. Raghavan, S. Rajagopalan, and A. Tomkins. The web as a graph: Measurements, models and methods. In Intl. Conf. on Combinatorics and Computing, 1999.
[14]
A. Lakhina, J. Byers, M. Crovella, and P. Xie. Sampling biases in IP topology measurements. In Proc. 22nd ACM SIGCOMM Conference, 2003.
[15]
J. Leguay, M. Latapy, T. Friedman, and K. Salamatian. Describing and simulating internet routes. Preprint cs.NI/0411051, 2004.
[16]
C. McDiarmid. Concentration. In M. Habib, C. McDiarmid, J. Ramirez-Alfonsin, and B. Reed, editors, Probabilistic Methods for Algorithmic Discrete Mathematics, pages 195--247. Springer, 1998.
[17]
M. Mihail, C. Papadimitriou, and A. Saberi. On certain connectivity properties of the Internet topology. In Proc. 35th ACM Symp. on Theory of Computing, 2003.
[18]
M. Molloy and B. Reed. A critical point for random graphs with a given degree sequence. Random Structures and Algorithms, 6:161--180, 1995.
[19]
M. Molloy and B. Reed. The size of the largest component of a random graph on a fixed degree sequence. Combinatorics, Probability and Computing, 7:295--306, 1998.
[20]
R. Motwani and P. Raghavan. Randomized Algorithms. Cambridge University Press, 1990.
[21]
J.-J. Pansiot and D. Grad. On routers and multicast trees in the Internet. ACM SIGCOMM Communication Review, 28:41--50, 1998.
[22]
T. Petermann and P. De Los Rios. Exploration of scale-free networks: do we measure the real exponents? Euro. Phys. Journal B, 38:201, 2004.
[23]
J. B. Seaborn. Hypergeometric Functions and Their Applications. Springer, 1991.
[24]
J. Spanier and K.B. Oldham. The exponential integral ei(x) and related functions. In An Atlas of Functions, chapter 37, pages 351--360. Hemisphere, 1987.
[25]
H.S. Wilf. generatingfunctionology. Academic Press, 1994.
[26]
N.C. Wormald. Models of random regular graphs. In J. Lamb and D. Preece, editors, London Math. Soc. Lecture Note Series, volume 276, pages 239--298. Cambridge University Press, 1999.

Cited By

View all
  • (2021)A Statistical Framework for Modern Network ScienceStatistical Science10.1214/19-STS75936:1Online publication date: 1-Feb-2021
  • (2019)Inferring Structural Characteristics of Social Network for Smart Grid Adoption and Diffusion2019 International Conference on Intelligent Computing, Automation and Systems (ICICAS)10.1109/ICICAS48597.2019.00015(29-34)Online publication date: Dec-2019
  • (2019)Fast crawling methods of exploring content distributed over large graphsKnowledge and Information Systems10.1007/s10115-018-1178-x59:1(67-92)Online publication date: 1-Apr-2019
  • Show More Cited By

Index Terms

  1. On the bias of traceroute sampling: or, power-law degree distributions in regular graphs

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      STOC '05: Proceedings of the thirty-seventh annual ACM symposium on Theory of computing
      May 2005
      778 pages
      ISBN:1581139608
      DOI:10.1145/1060590
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 22 May 2005

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. internet mapping
      2. power laws
      3. random graphs

      Qualifiers

      • Article

      Conference

      STOC05
      Sponsor:
      STOC05: Symposium on Theory of Computing
      May 22 - 24, 2005
      MD, Baltimore, USA

      Acceptance Rates

      Overall Acceptance Rate 1,469 of 4,586 submissions, 32%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 21 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)A Statistical Framework for Modern Network ScienceStatistical Science10.1214/19-STS75936:1Online publication date: 1-Feb-2021
      • (2019)Inferring Structural Characteristics of Social Network for Smart Grid Adoption and Diffusion2019 International Conference on Intelligent Computing, Automation and Systems (ICICAS)10.1109/ICICAS48597.2019.00015(29-34)Online publication date: Dec-2019
      • (2019)Fast crawling methods of exploring content distributed over large graphsKnowledge and Information Systems10.1007/s10115-018-1178-x59:1(67-92)Online publication date: 1-Apr-2019
      • (2018)Ego-Centered View to Overcome the Internet Measurement Challenges in West Africae-Infrastructure and e-Services for Developing Countries10.1007/978-3-319-98827-6_36(377-383)Online publication date: 21-Aug-2018
      • (2017)Revisiting Power-law Distributions in Spectra of Real World NetworksProceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining10.1145/3097983.3098128(817-826)Online publication date: 13-Aug-2017
      • (2017)Inferring Structural Characteristics of Networks With Strong and Weak Ties From Fixed-Choice SurveysIEEE Transactions on Signal and Information Processing over Networks10.1109/TSIPN.2017.27310533:3(513-525)Online publication date: Sep-2017
      • (2017)Neighbor-Neighbor Correlations Explain Measurement Bias in NetworksScientific Reports10.1038/s41598-017-06042-07:1Online publication date: 17-Jul-2017
      • (2017)Topics at the Frontier of Statistics and Network Analysis10.1017/9781108290159Online publication date: 6-Jun-2017
      • (2016)The "Majority Illusion" in Social NetworksPLOS ONE10.1371/journal.pone.014761711:2(e0147617)Online publication date: 17-Feb-2016
      • (2016)Inferring network properties from fixed-choice design with strong and weak ties2016 IEEE Statistical Signal Processing Workshop (SSP)10.1109/SSP.2016.7551713(1-5)Online publication date: Jun-2016
      • Show More Cited By

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media