Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article
Open access

Each at its Own Pace: Third-Party Dependency and Centralization Around the World

Published: 02 March 2023 Publication History

Abstract

We describe the results of a large-scale study of third-party dependencies around the world based on regional top-500 popular websites accessed from vantage points in 50 countries, together covering all inhabited continents. This broad perspective shows that dependencies on a third-party DNS, CDN or CA provider vary widely around the world, ranging from 19% to as much as 76% of websites, across all countries. The critical dependencies of websites -- where the site depends on a single third-party provider -- are equally spread ranging from 5% to 60% (CDN in Costa Rica and DNS in China, respectively). Interestingly, despite this high variability, our results suggest a highly concentrated market of third-party providers: three providers across all countries serve an average of 92% and Google, by itself, serves an average of 70% of the surveyed websites. Even more concerning, these differences persist a year later with increasing dependencies, particularly for DNS and CDNs. We briefly explore various factors that may help explain the differences and similarities in degrees of third-party dependency across countries, including economic conditions, Internet development, economic trading partners, categories, home countries, and traffic skewness of the country's top-500 sites.

References

[1]
2012. (2012). https://nordvpn.com
[2]
2016. Dyn Analysis Summary of Friday October 21 Attack. (October 2016). http://dyn.com/blog/dyn-analysis-summary-of-friday-october-21-attack/
[3]
2016. Globalsign certificate revocation issue. (October 2016). https://www.globalsign.com/en/status
[4]
2018. (2018). https://www.exportgenius.in/blog/10-major-regional-trading-blocs-of-the-world-236.php
[5]
2019. Azure global outage: Our DNS update mangled domain records, says Microsoft. https://www.zdnet.com/article/azure-global-outage-our-dns-update-mangled-\domain-records-says-microsoft/. (2019).
[6]
2019. How Verizon and a BGP Optimizer Knocked Large Parts of the Internet Offline Today. (June 2019). https://blog.cloudflare.com/how-verizon-and-a-bgp-\optimizer-knocked-large-parts-of-the-internet-offline-today/.
[7]
2020. Consolidation in the Internet Economy. (Feb 2020). https://future.internetsociety.org/2019/
[8]
2020. GoDaddy (all of it) went down this evening. (November 2020). https://domainnamewire.com/2020/11/17/godaddy-is-down/
[9]
2021. (Oct 2021). https://en.wikipedia.org/wiki/Knowledge_Economic_Index
[10]
2022. (May 2022). https://github.com/cisagov/findcdn
[11]
2022. (Mar 2022). https://en.wikipedia.org/wiki/ICT_Development_Index
[12]
Bernhard Ager, Wolfgang Mühlbauer, Georgios Smaragdakis, and Steve Uhlig. 2011. Web Content Cartography.
[13]
Akamai. 2022. Akamai CDN Deployment. (2022). https://seekingalpha.com/article/4379686-akamai-granddaddy-of-cdn-is-well-positioned-for-next-generation-applications
[14]
Mark Allman. 2018. Comments on DNS Robustness.
[15]
Amazon. 2022. Amazon CDN Deployment. (2022). https://aws.amazon.com/cloudfront/features/"whats-new-cloudfront.sort-by=item.additionalFields.postDateTime&whats-new-cloudfront.sort-order=desc
[16]
Jari Arkko. 2019. Centralised Architectures in Internet Infrastructure. IETF Internet Draft (2019).
[17]
Jari Arkko. 2020. The influence of Internet architecture on centralised versus distributed Internet services. Journal of Cyber Policy 5, 1 (2020), 30--45.
[18]
Subin B. 2021. 7 Best Alexa.com Alternatives for Website Ranking and Traffic Analysis. (Dec 2021). https://beebom.com/best-alexa-com-alternatives/
[19]
Samantha Bates, John Bowers, Shane Greenstein, Jordi Weinstock, Yunhan Xu, and Jonathan Zittrain. 2021. Evidence of Decreasing Internet Entropy: The Lack of Redundancy in DNS Resolution by Major Websites and Services. Journal of Quantitative Description: Digital Media 1 (2021). https://doi.org/10.51685/jqd.2021.011
[20]
Alexandros Biliris, Chuck Cranor, Fred Douglis, Michael Rabinovich, Sandeep Sibal, Oliver Spatscheck, and Walter Sturm. 2002. Computer Communications 25 (March 2002). Issue 4.
[21]
Michael Butkiewicz, Harsha V. Madhyastha, and Vyas Sekar. 2011. Understanding Website Complexity: Measurements, Metrics, and Implications.
[22]
Matt Calder, Xun Fan, Zi Hu, Ethan Katz-Bassett, John Heidemann, and Ramesh Govindan. 2013. Mapping the Expansion of Google's Serving Infrastructure.
[23]
Patricia Callejo, Rubén Cuevas, Narseo Vallina-Rodriguez, and Ángel Cuevas. 2019. Measuring the Global Recursive DNS Infrastructure: A View From the Edge. IEEE Access 7 (10 2019), 1--1. https://doi.org/10.1109/ACCESS.2019.2950325
[24]
Taejoong Chung, Jay Lok, Balakrishnan Chandrasekaran, David Choffnes, Dave Levin, Bruce M. Maggs, Alan Mislove, John Rula, Nick Sullivan, and Christo Wilson. 2018. Is the Web Ready for OCSP Must-Staple" Proceedings of the Internet Measurement Conference 2018 (Oct 2018). https://doi.org/10.1145/3278532.3278543
[25]
Cloudflare. 2022. Cloudflare. (2022). https://www.cloudflare.com/network/
[26]
David Coldewey. 2020. Cloudflare DNS goes down, taking a large piece of the Internet with it. TechCrunch Blog. (July 2020). http://tcrn.ch/3pbDJzL
[27]
Trinh Viet Doan, Roland van Rijswijk-Deij, Oliver Hohlfeld, and Vaibhav Bajpai. 2022. An Empirical View on Consolidation of the Web. ACM Transactions on Internet Technology 22, 3 (Aug 2022), 1--30. https://doi.org/10.1145/3503158
[28]
Rodérick Fanou, Gareth Tyson, Eder Leao Fernandes, Pierre Francois, Francisco Valera, and Arjuna Sathiaseelan. 2018. Exploring and Analysing the African Web Ecosystem. ACM Trans. Web (2018), 26. https://doi.org/10.1145/3213897
[29]
Petros Gigis, Matt Calder, Lefteris Manassakis, George Nomikos, Vasleois Kotronis, Xenofontas Dimitropoulos, Ethan Katz-Bassett, and Georgios Smaragdakis. 2021. Seven years in the life of Hypergiants' off-nets.
[30]
Google. 2022. Chrome User Experience Report | Chrome UX Report |Google Developers. (2022). https://developers.google.com/web/tools/chrome-user-experience-report
[31]
Google. 2022. understanding-google-cloud-network-edge-points. (2022). https://cloud.google.com/blog/products/networking/understanding-google-cloud-network-edge-points
[32]
Heritage. 2022. Economic Freedom. (2022). https://www.heritage.org/index/explore"view=by-region-country-year&u=637879938517599600
[33]
Nguyen Phong Hoang, Ivan Lin, Seyedhamed Ghavamnia, and Michalis Polychronakis. 2020. K-resolver: towards decentralizing encrypted DNS resolution. arXiv preprint arXiv:2001.08901 (2020).
[34]
Geoff Huston. 2019. DNS Resolver Centrality. APNIC Blog. (September 2019). https://labs.apnic.net/?p=1260
[35]
Geoff Huston. 2021. CDN and centrality. APNIC Blog. (July 2021). https://blog.apnic.net/2021/07/02/opinion-cdns-and-centrality/
[36]
IMD. 2021. World Digital Competitiveness Rankings - IMD. (2021). https://www.imd.org/centers/world-competitiveness-center/rankings/world-digital-competitiveness/
[37]
IP2Location. [n. d.]. Free IP Geolocation Database. ([n. d.]). https://lite.ip2location.com/
[38]
Aqsa Kashaf, Vyas Sekar, and Yuvraj Agarwal. 2020. Analyzing Third Party Service Dependencies in Modern Web Services: Have We Learned from the Mirai-Dyn Incident?
[39]
Raul Katz and Fernando Callorda. 2018. Accelerating the development of Latin American digital ecosystem and implications for broadband policy. Telecommunications Policy 42, 9 (Oct 2018), 661--681. https://doi.org/10.1016/j.telpol.2017.11.002
[40]
Mohammad Taha Khan, Joe DeBlasio, Geoffrey M. Voelker, Alex C. Snoeren, Chris Kanich, and Narseo Vallina-Rodriguez. 2018. An Empirical Analysis of the Commercial VPN Ecosystem (IMC '18). Association for Computing Machinery, New York, NY, USA, 443--456. https://doi.org/10.1145/3278532.3278570
[41]
Avery Koop. 2021. Mapped: GDP per Capita Worldwide. (July 2021). https://www.visualcapitalist.com/mapped-gdp-per-capita-worldwide/
[42]
Deepak Kumar, Zane Ma, Zakir Durumeric, Ariana Mirian, Joshua Mason, J. Alex Halderman, and Michael Bailey. 2017. Security Challenges in an Increasingly Tangled Web.
[43]
Zhenyu Li, Donghui Yang, Zhenhua Li, Chunjing Han, and Gaogang Xie. 2018. Mobile Content Hosting Infrastructure in China: A View from a Cellular ISP. Passive and Active Measurement Lecture Notes in Computer Science (2018), 100--113.
[44]
Mozilla Public Suffix List. [n. d.]. Public Suffix List. ([n. d.]). https://publicsuffix.org/
[45]
J. Livingood, M. Antonakakis, B. Sleigh, and A. Winfield. 2019. Centralized DNS over HTTPS (DoH) implementation issues and risks. (2019).
[46]
Srdjan Matic, Gareth Tyson, and Gianluca Stringhini. 2019. PYTHIA: a Framework for the Automated Analysis of Web Hosting Environments. The World Wide Web Conference (2019).
[47]
Maxmind. 2022. MaxMind Server IP Addresses. (2022). https://dev.maxmind.com/geoip/geolite2-free-geolocation-data
[48]
Mcafee. 2022. Mcafee sitelookup. (2022). https://sitelookup.mcafee.com/en/feedback/url
[49]
Foivos Michelinakis, Hossein Doroud, Abbas Razaghpanah, Andra Lutu, Narseo Vallina-Rodriguez, Phillipa Gill, and Joerg Widmer. 2018. The Cloud that Runs the Mobile Internet: A Measurement Study of Mobile Cloud Services. https://doi.org/10.1109/INFOCOM.2018.8485872
[50]
Chance Miller. 2021. PSA: Facebook, Instagram, Messenger, and WhatsApp went down for 6 hours; here's why [U]. (Oct 2021). https://9to5mac.com/2021/10/04/instagram-facebook-whatsapp-down/
[51]
Giovane Moura, Sebastian Castro, Wes Hardaker, Maarteb Wullink, and Cristian Hesselman. 2020. Clouding up the Internet: how centralized is DNS traffic becoming?
[52]
NRI. 2022. Network Readiness Index. (2022). https://networkreadinessindex.org/countries/#ranking-wrapper
[53]
Victore Le Pochat, Tom Van Goethem, Samaneh Tajalizadehkhoob, Maciej Korczynski, and Wouter Joosen. 2019. Tranco: A Research-Oriented Top Sites Ranking Hardened Against Manipulation.
[54]
Ingmar Poese, Steve Uhlig, Mohamed Ali Kaafar, Benoit Donnet, and Bamba Gueye. 2011. IP Geolocation Databases: Unreliable? 41, 2 (2011).
[55]
Roxana Radu and Michael Hausding. 2020. Consolidation in the DNS resolver market -- how much, how fast, how dangerous? Journal of Cyber Policy 5 (02 2020), 1--19. https://doi.org/10.1080/23738871.2020.1722191
[56]
Global Rankings. 2022. Connectivity Index. (2022). https://www.mobileconnectivityindex.com/#year=2021&globalRankings=overall&globalRankingsYear=2021
[57]
Quirin Scheitle, Oliver Hohlfeld, Julien Gamba, Jonas Jelten, Torsten Zimmermann, Stephen D. Strowes, and Narseo Vallina-Rodriguez. 2018. A Long Way to the Top. (2018).
[58]
Rachee Singh, Arun Dunna, and Phillipa Gill. 2018. Characterizing the Deployment and Performance of Multi-CDNs.
[59]
Ankit Singla, Balakrishnan Chandrasekaran, P. Brighten Godfrey, and Bruce Maggs. 2014. The Internet at the Speed of Light.
[60]
OCSP Stapling. 2017. The Problem with OCSP Stapling and Must Staple and why Certificate Revocation is still broken - Hanno's blog. (2017). https://blog.hboeck.de/archives/886-The-Problem-with-OCSP-Stapling-and-%5CMust-Staple-and-why-Certificate-Revocation-is-still-broken.html
[61]
Statistics. 2022. Internet Pentrations. (2022). https://www.statista.com/statistics/227082/countries-with-the-highest-internet-penetration-rate/
[62]
Jannick Sørensen and Sokol Kosta. 2019. Before and After GDPR: The Changes in Third Party Presence at Public and Private European Websites. The World Wide Web Conference on - WWW '19 (2019). https://doi.org/10.1145/3308558.3313524
[63]
Timlib. [n. d.]. WebXray Domain Owner List. ([n. d.]).
[64]
Tobias Urban, Martin Degeling, Thorsten Holz, and Norbert Pohlmann. [n. d.]. Beyond the Front Page: Measuring Third Party Dynamics in the Field. ([n. d.]). https://doi.org/10.1145/3366423.3380203
[65]
Whois. 2022. Whois. (2022). https://whois.icann.org/en
[66]
world population. 2022. internet-users-by-country. (2022). https://worldpopulationreview.com/country-rankings/internet-users-by-country
[67]
Jing'an Xue, David Choffnes, and Jilong Wang. 2017. CDNs Meet CN An Empirical Study of CDN Deployments in China. IEEE Access 5 (2017), 5292--5305. https://doi.org/10.1109/ACCESS.2017.2682190
[68]
Bahador Yeganeh, Ramakrishnan Durairajan, Reza Rejaie, and Walter Willinger. 2020. A First Comparative Characterization of Multi-cloud Connectivity in Today's Internet.
[69]
Hao Yin, Bo Qiao, Yan Luo, Li Ruyue, and Y. Yang. 2015. Demystifying commercial content delivery networks in China. Concurrency and Computation: Practice and Experience 27 (06 2015). https://doi.org/10.1002/cpe.3464
[70]
Luciano Zembruzki, Arthur Selle Jacobs, Gustavo Spier Landtreter, Lisandro Zambenedetti Granville, and Giovanne Moura. 2020. dnstracker: Measuring Centralization of DNS Infrastructure in the Wild. In Proc. of AINA.

Cited By

View all
  • (2023)The Role of Advanced Math in Teaching Performance ModelingACM SIGMETRICS Performance Evaluation Review10.1145/3626570.362659151:2(59-64)Online publication date: 2-Oct-2023
  • (2023)Quantifying Security Risks in Cloud Infrastructures: A Data-driven Approach2023 IEEE 9th International Conference on Network Softwarization (NetSoft)10.1109/NetSoft57336.2023.10175501(346-349)Online publication date: 19-Jun-2023

Index Terms

  1. Each at its Own Pace: Third-Party Dependency and Centralization Around the World

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image Proceedings of the ACM on Measurement and Analysis of Computing Systems
        Proceedings of the ACM on Measurement and Analysis of Computing Systems  Volume 7, Issue 1
        POMACS
        March 2023
        749 pages
        EISSN:2476-1249
        DOI:10.1145/3586099
        Issue’s Table of Contents
        This work is licensed under a Creative Commons Attribution International 4.0 License.

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 02 March 2023
        Published in POMACS Volume 7, Issue 1

        Check for updates

        Author Tags

        1. CA
        2. CDN
        3. DNS
        4. centralization
        5. third-party dependency

        Qualifiers

        • Research-article

        Funding Sources

        • Comcast Innovation Fund Award

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)346
        • Downloads (Last 6 weeks)49
        Reflects downloads up to 16 Nov 2024

        Other Metrics

        Citations

        Cited By

        View all
        • (2023)The Role of Advanced Math in Teaching Performance ModelingACM SIGMETRICS Performance Evaluation Review10.1145/3626570.362659151:2(59-64)Online publication date: 2-Oct-2023
        • (2023)Quantifying Security Risks in Cloud Infrastructures: A Data-driven Approach2023 IEEE 9th International Conference on Network Softwarization (NetSoft)10.1109/NetSoft57336.2023.10175501(346-349)Online publication date: 19-Jun-2023

        View Options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Login options

        Full Access

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media