Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1267279.1267282guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

The measured access characteristics of world-wide-web client proxy caches

Published: 08 December 1997 Publication History

Abstract

The growing popularity of the World Wide Web is placing tremendous demands on the Internet. A key strategy for scaling the Internet to meet these increasing demands is to cache data near clients and thus improve access latency and reduce network and server load. Unfortunately, research in this area has been hampered by a poor understanding of the locality and sharing characteristics of Web-client accesses. The recent popularity of Web proxy servers provides a unique opportunity to improve this understanding, because a small number of proxy servers see accesses from thousands of clients.
This paper presents an analysis of access traces collected from seven proxy servers deployed in various locations throughout the Internet. The traces record a total of 47.4 million requests made by 23,700 clients over a twenty-one day period. We use a combination of static analysis and trace-driven cache simulation to characterize the locality and sharing properties of these accesses.
Our analysis shows that a 2- to 10-GB second-level cache yields hit rates between 24% and 45% with 85% of these hits due to sharing among different clients. Caches with more clients exhibit more sharing and thus higher hit rates. Between 2% and 7% of accesses are consistency misses to unmodified objects, using the Squid and CERN proxy cache coherence protocols. Sharing is bimodal. Requests for shared objects are divided evenly between objects that are narrowly shared and those that are shared by many clients; widely shared objects also tend to be shared by clients from unrelated traces.

References

[1]
{1} T. E. Anderson, M. D. Dahlin, J. M. Neefe, D. A. Patterson, D. S. Roselli, and R. Y. Wang. Serverless network file systems. ATM Transactions on Computer Systems, 14(1):41-79, February 1996.
[2]
{2} M. Arlitt and C. Williamson. Web server workload characterization: The search for invariants. In Proceedings of ACM SIGMETRICS'96 , May 1996.
[3]
{3} M. G. Baker, J. H. Hartman, M. D. Kupfer, K. W. Shirrif, and J. K. Ousterhout. Measurements of a distributed file system. In Proceedings of the 13th ACM Symposium on Operating Systems Principles, pages 198-212, October 1991.
[4]
{4} A. Bestavros C. R. Cunha and M. E. Crovella. Characteristics of www client-based traces. Technical report, Boston University, Jul. 1995.
[5]
{5} Squid Internet Object Cache. URL: http://squid.nlanr.net.
[6]
{6} L. D. Catledge and J. E. Pitkow. Characterizing browsing stragegies in the World-Wide Web. In Proceedings of the Third WWW Conference, 1994.
[7]
{7} A. Chankhunthod, P. Danzig, C. Neerdaels, M. F. Schwartz, and K. J. Worrell. A hierarchical Internet object cache. In USENIX 1996 Annual Technical Conference, January 1996.
[8]
{8} National Laboratory for Advanced Network Research (NLANR) Proxy Traces. URL: ftp://ircache.nlanr.net/Traces/.
[9]
{9} S. Gadde, M. Rabinovich, and J. Chase. Reduce, reuse, recycle: An approach to building large Internet caches. In Sixth Workshop on Hot Topics in Operating Systems, 1996.
[10]
{10} Steven Glassman. A caching relay for the world wide web. In Proceedings of the First Interntional Conference on the WWW, 1994.
[11]
{11} S. D. Gribble and E. A. Brewer. System design issues for internet middleware services: Deductions from a large client trace. In Proceedings of the Usenix Symposium on Internet Technologies and Systems '97, 1997.
[12]
{12} J. Gwertzman and M. Seltzer. The case for geographical push-caching. In Fifth Workshop on Hot Topics in Operating Systems, 1995.
[13]
{13} B. Krishnamurthy and C. E. Wills. Study of piggyback cache validation for proxy caches in the world wide web. In Proceedings of the Usenix Symposium on Internet Technologies and Systems '97, 1997.
[14]
{14} A. Luotonen, H. Frystyk, and T. Berners-Lee. W3C httpd. URL: http://www.w3.org/hypertext/WWW/Daemon/.
[15]
{15} C. Maltzahn and K. J. Richardson. Performance issues of enterprise level web proxies. In ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, June 1997.
[16]
{16} J. Pitkow and M. Recker. A simple yet robust caching algorithm based on dynamic access patterns. In Electronic Proceedings of the Second World Wide Web Conference '94: Mosaic and the Web, 1994.
[17]
{17} J. Lorch R. Malpani and D. Berger. Making world wide web caching servers cooperate. In Fourth International World-wide Web Conference, pages 107-110, Dec. 1995.
[18]
{18} J. Chase S. Gadde and M. Rabinovich. Directory structures for scalable internet caches. Technical Report CS-1997-18, Duke University, 1997.
[19]
{19} P. Sarkar and J. Hartman. Efficient cooperative caching using hints. In Proceedings of the USENIX Conference on Operating Systems Design and Implementation, Oct. 1996.
[20]
{20} Jeff Sedayao. "mosaic will kill my network!" - studying network traffic patterns of mosaic use. In Electronic Proceedings of the Second World Wide Web Conference '94: Mosaic and the Web, 1994.
[21]
{21} Digital Equipment Corporation Proxy Traces. URL: ftp://ftp.digital.com/pub/DEC/traces/proxy/tracelistv1.2.html.
[22]
{22} D. Wessels. Intelligent caching for world-wide web objects. Master's thesis, Washington State University, 1995.
[23]
{23} D. L. Willick, D. L. Eager, and R. B. Bunt. Disk cache replacement policies for network fileservers. In Proceedings of the IEEE International Conference on Distributed Computer Systems, pages 2-11, June 1993.

Cited By

View all
  • (2012)Workload analysis of a large-scale key-value storeACM SIGMETRICS Performance Evaluation Review10.1145/2318857.225476640:1(53-64)Online publication date: 11-Jun-2012
  • (2012)Workload analysis of a large-scale key-value storeProceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems10.1145/2254756.2254766(53-64)Online publication date: 11-Jun-2012
  • (2011)Characterizing Organizational Use of Web-Based ServicesACM Transactions on the Web10.1145/2019643.20196465:4(1-23)Online publication date: 1-Oct-2011
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
USITS'97: Proceedings of the USENIX Symposium on Internet Technologies and Systems on USENIX Symposium on Internet Technologies and Systems
December 1997
266 pages

Publisher

USENIX Association

United States

Publication History

Published: 08 December 1997

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 17 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2012)Workload analysis of a large-scale key-value storeACM SIGMETRICS Performance Evaluation Review10.1145/2318857.225476640:1(53-64)Online publication date: 11-Jun-2012
  • (2012)Workload analysis of a large-scale key-value storeProceedings of the 12th ACM SIGMETRICS/PERFORMANCE joint international conference on Measurement and Modeling of Computer Systems10.1145/2254756.2254766(53-64)Online publication date: 11-Jun-2012
  • (2011)Characterizing Organizational Use of Web-Based ServicesACM Transactions on the Web10.1145/2019643.20196465:4(1-23)Online publication date: 1-Oct-2011
  • (2009)Saperlipopette!Proceedings of the IFIP International Conference on Distributed Systems Platforms and Open Distributed Processing10.5555/1659232.1659265(389-405)Online publication date: 25-Nov-2009
  • (2007)SpyProxyProceedings of 16th USENIX Security Symposium on USENIX Security Symposium10.5555/1362903.1362906(1-16)Online publication date: 6-Aug-2007
  • (2007)Youtube traffic characterizationProceedings of the 7th ACM SIGCOMM conference on Internet measurement10.1145/1298306.1298310(15-28)Online publication date: 24-Oct-2007
  • (2006)A Distributed Algorithm for Sharing Web Cache Disk CapacityProceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 110.1109/ICPADS.2006.12(457-464)Online publication date: 12-Jul-2006
  • (2006)Simulations of distributed service-based content adaptation for network optimizationProceedings of the 6th international conference on Next Generation Information Technologies and Systems10.1007/11780991_19(210-220)Online publication date: 4-Jul-2006
  • (2005)SSL splittingComputer Networks: The International Journal of Computer and Telecommunications Networking10.5555/1090583.164861348:5(763-779)Online publication date: 5-Aug-2005
  • (2005)Measuring IP and TCP behavior on edge nodes with TstatComputer Networks: The International Journal of Computer and Telecommunications Networking10.5555/1052084.164864747:1(1-21)Online publication date: 14-Jan-2005
  • Show More Cited By

View Options

View options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media