Nothing Special   »   [go: up one dir, main page]

skip to main content
article
Free access

Contrasting characteristics and cache performance of technical and multi-user commercial workloads

Published: 01 November 1994 Publication History

Abstract

Experience has shown that many widely used benchmarks are poor predictors of the performance of systems running commercial applications. Research into this anomaly has long been hampered by a lack of address traces from representative multi-user commercial workloads. This paper presents research, using traces of industry-standard commercial benchmarks, which examines the characteristic differences between technical and commercial workloads and illustrates how those differences affect cache performance.
Commercial and technical environments differ in their respective branch behavior, operating system activity, I/O, and dispatching characteristics. A wide range of uniprocessor instruction and data cache geometries were studied. The instruction cache results for commercial workloads demonstrate that instruction cache performance can no longer be neglected because these workloads have much larger code working sets than technical applications. For database workloads, a breakdown of kernel and user behavior reveals that the application component can exhibit behavior similar to the operating system and therefore, can experience miss rates equally high. This paper also indicates that “dispatching” or process switching characteristics must be considered when designing level-two caches. The data presented shows that increasing the associativity of second-level caches can reduce miss rates significantly. Overall, the results of this research should help system designers choose a cache configuration that will perform well in commercial markets.

References

[1]
Anant Agarwal, "Analysis of Cache Performance for Operating Systems and Multi-programming", ACM Transactions on Computer Systems, Vol. 6. No. 4, Nov. 1988.
[2]
James Bell and David Casasent, "An investigation of Alternative Cache Organizations", IEEE Transactions on Computers, Vol. C-23. No. 4, April 1974, p. 346-351.
[3]
Anita Borg, R.E. Kessler, Georgia Lazana, and David W. Wall, "Long Address Traces from RISC Machines: Generation and Analysis'', WRL Research Report 89/14, DEC Research Lab, Sept. 1989.
[4]
J. Bradley Chert and Brian N. Bershad, "The Impact of Operating System Structure on Memory System Performance", Operating Systems Review, Vol. 27., No. 5, Dec. 1993.
[5]
Douglas W. Clark, "Cache Performance in the VAX-11/ 780", ACM Transactions on Computer Systems, Vol. 1., No. 1, Feb. 1983.
[6]
Thomas M. Conte and Wen-mei W. Hwu, "Benchmark Characterization", Computer, January 1991, pp 48-56.
[7]
Zarka Cvetanovic and Dileep Bhandarkar, "Characterization of Alpha Performance using TP and SPEC Workloads", Proceedings of the 21th international Symposium on Computer Architecture, Chicago, Illinois, April 18-21, 1994.
[8]
M. Franklin, W. Alexander, R. Jauhari, A. M. G. Maynard, and B. R. Olszewski, "Commercial Workload Performance in the IBM POWER2 RISC System/6000 Processor," IBM Journal of Research and Development, Vol. 38, No. 4, July 1994.
[9]
Jeffrey D. Gee, Mark D. Hill, Dionisios N. Pnevmatikatos, Alan Jay Smith, "Cache Performance of the SPEC92 Benchmark Suite", IEEE Micro, Aug. 1993, p. 17-27.
[10]
John L. Hennessy and David A. Patterson, "Computer Architecture: A Quantitative Approach", Morgan Kaufmann Publishers, Inc., 1990.
[11]
Norma P. Jouppi, "Improving Direct-Mapped Cache Performance by the Addition of a Small Fully-Associative Cache and Prefetch Buffers", 17th Annual international Symposium on Computer Architecture, Seattle, Washington, May 1990.
[12]
Ann Marie Grizzaffi Maynard, Colette M. Donnelly, Bret R. Olszewski, "Contrasting Characteristics and Cache Performance of Technical and Multi-User Commercial Workloads", IBM Technical Report, 1994. (submitted for pubhcauon)
[13]
John J. McGrory II, Alexander Carlton, and Bradley J. Askins, "Transaction Processing Performance on PA-RISC Commercial Unix Systems", COMPCON, Spring 1992, 37th IEEE Computer Society International Conference, San Francisco, CA., Feb. 1992, p. 199-206.
[14]
M. Misra, ed., IBM RISC System/6000 Technology, IBM Corporation, 1990, IBM Publication SA 23-2619.
[15]
A. Poursepanj et aL, "The PowerPCTM 603 Microprocessor: Performance Analysis and Design Trade-offs," Proceedings of COMPCON 1994, February 1994.
[16]
Cache and Memory Hierarchy Design: A Performance Directed Approach, Morgan Kaufmann Publishers, Inc., 1990.
[17]
Alan Jay Smith, "Cache Memories", Computing Surveys, Vol. 14, No. 3, Sept. 1982.
[18]
Alan Jay Smith, "Cache Evaluataons and the Impact of Workload Chome", 12th Annual International Symposium on Computer Architecture Conference Proceedings, Boston, MA, June 17-19, 1985.
[19]
SPEC Newsletter, Vol. 3., No. 4, December 1991, pp 18-21.
[20]
Chriss Stephens, Bryce CogsweU, John Heinlem, Gregory Palmer, and John P. Shen, "Instruction Level Profiling and Evaluation of the IBM RISC System/60000", Proceedings of the 18th International Symposium on Computer Architecture, Toronto, Canada, May 1991.
[21]
Josep Torrellas, Anoop Gupta, and John Hennessy, "Characterizing the Caching and SynchronizaUon Performance of a Multiprocessor Operating System", Fifth International Conference on Architectural Support for Programming Languages and Operating Systems, Boston, MA, October 12-15, 1992.

Cited By

View all
  • (2012)Power-aware multi-core simulation for early design stage hardware/software co-optimizationProceedings of the 21st international conference on Parallel architectures and compilation techniques10.1145/2370816.2370820(3-12)Online publication date: 19-Sep-2012
  • (2007)RxACM Transactions on Computer Systems10.1145/1275517.127551925:3(7-es)Online publication date: 1-Aug-2007
  • (2007)Experience distributing objects in an SMMP OSACM Transactions on Computer Systems10.1145/1275517.127551825:3(6-es)Online publication date: 1-Aug-2007
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 November 1994
Published in SIGPLAN Volume 29, Issue 11

Check for updates

Author Tags

  1. cache performance
  2. commercial workloads
  3. memory subsystems
  4. operating system activity
  5. technical applications

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)185
  • Downloads (Last 6 weeks)36
Reflects downloads up to 02 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2012)Power-aware multi-core simulation for early design stage hardware/software co-optimizationProceedings of the 21st international conference on Parallel architectures and compilation techniques10.1145/2370816.2370820(3-12)Online publication date: 19-Sep-2012
  • (2007)RxACM Transactions on Computer Systems10.1145/1275517.127551925:3(7-es)Online publication date: 1-Aug-2007
  • (2007)Experience distributing objects in an SMMP OSACM Transactions on Computer Systems10.1145/1275517.127551825:3(6-es)Online publication date: 1-Aug-2007
  • (2007)Out-of-core coherent closed quasi-clique mining from large dense graph databasesACM Transactions on Database Systems10.1145/1242524.124253032:2(13-es)Online publication date: 1-Jun-2007
  • (2007)An empirical study of static program slice sizeACM Transactions on Software Engineering and Methodology10.1145/1217295.121729716:2(8-es)Online publication date: 1-Apr-2007
  • (2007)Named entity translation matching and learningACM Transactions on Information Systems10.1145/1198296.119829825:1(2-es)Online publication date: 1-Feb-2007
  • (2007)Precision recall with user modeling (PRUM)ACM Transactions on Information Systems10.1145/1198296.119829725:1(1-es)Online publication date: 1-Feb-2007
  • (2006)Spatial graph grammars for graphical user interfacesACM Transactions on Computer-Human Interaction10.1145/1165734.116573913:2(268-307)Online publication date: 1-Jun-2006
  • (2006)Loop scheduling with timing and switching-activity minimization for VLIW DSPACM Transactions on Design Automation of Electronic Systems10.1145/1124713.112472411:1(165-185)Online publication date: 1-Jan-2006
  • (2006)Compilers for leakage power reductionACM Transactions on Design Automation of Electronic Systems10.1145/1124713.112472311:1(147-164)Online publication date: 1-Jan-2006
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media