Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/800015.808178acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article
Free access

Experimental evaluation of on-chip microprocessor cache memories

Published: 01 January 1984 Publication History

Abstract

Advances in integrated circuit density are permitting the implementation on a single chip of functions and performance enhancements beyond those of a basic processors. One performance enhancement of proven value is a cache memory; placing a cache on the processor chip can reduce both mean memory access time and bus traffic. In this paper we use trace driven simulation to study design tradeoffs for small (on-chip) caches. Miss ratio and traffic ratio (bus traffic) are the metrics for cache performance. Particular attention is paid to sub-block caches (also known as sector caches), in which address tags are associated with blocks, each of which contains multiple sub-blocks; sub-blocks are the transfer unit. Using traces from two 16-bit architectures (Z8000, PDP-11) and two 32-bit architectures (VAX-11, System/370), we find that general purpose caches of 64 bytes (net size) are marginally useful in some cases, while 1024-byte caches perform fairly well; typical miss and traffic ratios for a 1024 byte (net size) cache, 4-way set associative with 8 byte blocks are: PDP-11: .039, .156, Z8000: .015, .060, VAX 11: .080, .160, Sys/370: .244, .489. (These figures are based on traces of user programs and the performance obtained in practice is likely to be less good.) The use of sub-blocks allows tradeoffs between miss ratio and traffic ratio for a given cache size. Load forward is quite useful. Extensive simulation results are presented.

References

[1]
A.J. Smith, "Cache Memories," Computing Surveys, vol. 14, no. 3, pp. 473-530, September, 1982.
[2]
J.S. Liptay, "Structural Aspects of the System/360 Model 85, Part II: The Cache," IBM Systems Journal, vol. 7, no. 1, pp. 15-21, 1968.
[3]
J. Bell, D. Casasent, and C.G. Bell, "An Investigation of Alternative Cache Organizations," IEEE Trans. on Computers, vol. C-23, no. 4, pp. 346-351, April 1974.
[4]
W.D. Strecker, "Cache Memories for PDP-11 Family Computers," Proc. Third International Symposium on Computer Architecture, pp. 155-158, January 1976.
[5]
J.R. Goodman, "Using Cache Memory to Reduce Processor Memory Traffic," Proc. Tenth International Symposium on Computer Architecture, pp. 124-131, Stockholm, Sweden, June 1983.
[6]
N. Tredennick, Personal Communication, 1983.
[7]
J.K.F. Lee and A.J. Smith, "Branch Prediction Strategies and Branch Target Buffer Design," Computer, vol. 17, no. 1, pp. 6 - 22, January, 1984.
[8]
VAX Hardware Handbook, Digital Equipment Corporation, Maynard, Massachusetts 01754, 1980.
[9]
D.P. Siewiorek, C.G. Bell, and A. Newell, Computer Structures: Principles and Examples, New York, 1982. McGraw Hill
[10]
B.T. Hailpern and B.L. Hitson, S-1 Architecture Manual, January 1979. Stanford Univ. CSL Report STAN-CS-79-715.
[11]
A.J. Smith, "Sequential Program Prefetching in Memory Hierarchies," Computer, vol. 11, no. 12, pp. 7-21, December 1978.
[12]
D.A. Patterson, P. Garrison, M.D. Hill, D. Lioupis, C. Nyberg, T.N. Sippel, and K.S. Van Dyke, "Architecture of a VLSI Instruction Cache for a RISC," Proc. Tenth International Symposium on Computer Architecture, pp. 108-116, Stockholm, Sweden, June 1983.
[13]
M.G.H. Katevenis, R.W. Sherburne, D.A. Patterson, and C.H. Séquin, "The RISC II Micro-Architecture," Proc. VLSI 88 Conference, Trondheim, Norway, August 1983.
[14]
D.A. Patterson and C.H. Séquin, "RISC I: A Reduced Instruction Set VLSI Computer," Proc. Eighth International Symposium on Computer Architecture, pp. 443-457, Minneapolis, Minnesota, May 1981.
[15]
A.J. Smith, "A Compararative Study of Set Associative Memory Mapping Algorithms and Their Use for Cache and Main Memory," IEEE Trans. on Software Engineering, vol. SE-4, no. 2, pp. 121-130, March 1978.
[16]
R.L. Mattson, J. Gecsei, D.R. Sluts, and I.L. Traiger, "Evaluation techniques for storage hierarchies," IBM Systems Journal, vol. 9, no. 2, pp. 78-117, 1970.
[17]
D. Bursky, "Innovative Chip Designs Lead to Dense, Superfast RAMs," Electronic Design, pp. 97-112, 18 August 1983.
[18]
M.D. Hill, Evaluation of On-Chip Cache Memories, December 1983. Master's Report, U.C. Berkeley.
[19]
D. Alpert, D. Carberry, M. Yamamura, Y. Chow, and P. Mak, "32-bit Processor Chip Integrates Major System Functions," Electronics, pp. 113-119, 14 July 1983.

Cited By

View all
  • (2023)Mosaic Pages: Big TLB Reach with Small PagesProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582021(433-448)Online publication date: 25-Mar-2023
  • (2017)Micro-Sector CacheACM Transactions on Architecture and Code Optimization10.1145/304668014:1(1-29)Online publication date: 21-Mar-2017
  • (2017)Near-Optimal Access Partitioning for Memory Hierarchies with Multiple Heterogeneous Bandwidth Sources2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2017.46(13-24)Online publication date: Feb-2017
  • Show More Cited By

Index Terms

  1. Experimental evaluation of on-chip microprocessor cache memories

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ISCA '84: Proceedings of the 11th annual international symposium on Computer architecture
    January 1984
    373 pages
    ISBN:0818605383
    DOI:10.1145/800015
    • cover image ACM SIGARCH Computer Architecture News
      ACM SIGARCH Computer Architecture News  Volume 12, Issue 3
      June 1984
      348 pages
      ISSN:0163-5964
      DOI:10.1145/773453
      Issue’s Table of Contents

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 January 1984

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Acceptance Rates

    Overall Acceptance Rate 543 of 3,203 submissions, 17%

    Upcoming Conference

    ISCA '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)88
    • Downloads (Last 6 weeks)18
    Reflects downloads up to 03 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)Mosaic Pages: Big TLB Reach with Small PagesProceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3582016.3582021(433-448)Online publication date: 25-Mar-2023
    • (2017)Micro-Sector CacheACM Transactions on Architecture and Code Optimization10.1145/304668014:1(1-29)Online publication date: 21-Mar-2017
    • (2017)Near-Optimal Access Partitioning for Memory Hierarchies with Multiple Heterogeneous Bandwidth Sources2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2017.46(13-24)Online publication date: Feb-2017
    • (2013)Skinflint DRAM systemProceedings of the 2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA.2013.6522304(25-34)Online publication date: 23-Feb-2013
    • (2011)Efficiently enabling conventional block sizes for very large die-stacked DRAM cachesProceedings of the 44th Annual IEEE/ACM International Symposium on Microarchitecture10.1145/2155620.2155673(454-464)Online publication date: 3-Dec-2011
    • (2009)MinervaProceedings of the 4th International Symposium on High Performance Computing10.1007/3-540-47847-7_8(64-77)Online publication date: 18-May-2009
    • (2008)Cache Noise PredictionIEEE Transactions on Computers10.1109/TC.2008.7557:10(1372-1386)Online publication date: 1-Oct-2008
    • (2007)Increasing cache capacity through word filteringProceedings of the 21st annual international conference on Supercomputing10.1145/1274971.1275002(222-231)Online publication date: 17-Jun-2007
    • (2006)Increasing the Cache Efficiency by Eliminating NoiseThe Twelfth International Symposium on High-Performance Computer Architecture, 2006.10.1109/HPCA.2006.1598121(147-156)Online publication date: 2006
    • (2005)Improving Multiprocessor Performance with Coarse-Grain Coherence TrackingACM SIGARCH Computer Architecture News10.1145/1080695.106999133:2(246-257)Online publication date: 1-May-2005
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media