Nothing Special   »   [go: up one dir, main page]

skip to main content
article
Free access

Improving cache performance with balanced tag and data paths

Published: 01 September 1996 Publication History

Abstract

There are two concurrent paths in a typical cache access --- one through the data array and the other through the tag array. The path through the data array drives the selected set out of the array. The path through the tag array determines cache hit/miss and, for set-associative caches, selects the appropriate line from within the selected set. In both direct-mapped and set-associative caches, the path through the tag array is significantly longer than that through the data array. In this paper, we propose a path balancing technique help match the delays of the tag and data paths. The basic idea behind this technique is to employ a separate subset of the tag array to decouple the one-to-one relationship between address tags and cache lines so as to achieve a design that provides higher performance. Performance evaluation using both TPC-C and SPEC92 benchmarks shows that this path balancing technique offers impressive improvements in overall system performance over conventional cache designs. For TPC-C, improvements in the range of 6% to 28% are possible.

References

[1]
A. Agarwal, and S. Pudar, "Column-Associative Caches: A Technique for Reducing the Miss Rate of Direct-Mapped Caches," Proc. of ~Oth Int'l Syrup. on Computer Architecture, May 1993, pp. 179-190.
[2]
J.-L. Baer, and W.-H. Wang, "On the Inclusion Property for Multi-level Cache Hierarchies," Proc. of 15th Int'l Syrup. on Computer Architecture, May 1988, pp. 73-80.
[3]
P. Bannon, and J. Keller, "Internal Architecture of AI- pha 21164 Microprocessor," Proc. COMPCON'95, Mar. 95, pp. 79-87.
[4]
T. Chappell, et.al., "A 2ns Cycle, 3.8ns Access 512KB CMOS ECL RAM with a Fully Pipelined Architecture," IEEE Journal of Solid-State Circuits, Vol. 26(11), Nov. 1991, pp. 1577-1585.
[5]
J. Gee, M. Hill, D. Pnevmatikatos, and A. Smith, "Cache Performance of the SPEC92 Benchmark Suite," IEEE Micro, Vol. 13(4), Aug. 1993, pp. 17-27.
[6]
J. Goodman, "Coherency for Multiprocessor Virtual Address Caches," Proc. of 2nd Int'l Conf. on Architectural Support for Programming Languages and Operating Systems, Oct. 1987, pp. 72-81.
[7]
D. Grccnlcy et al., "UltraSPARG: The Next Generation Superscalar 64-bit SPARC," Proc. COMPCON'95, Mar. 95, pp. 442-451.
[8]
J. Hennessy, and D. Patterson, "Computer Architecture, A Quantitative Approach," 2nd-Edition, Morgan- Kaufmann, 1996.
[9]
M. Hill "A Case for Direct-Mapped Caches," IEEE Computer, Vol. 21(12), Dec. 1988, pp. 25-40.
[10]
HSPICE User's Manual, Meta-Software Inc, 1992.
[11]
K. Hua, L. Liu, and J.-K. Peir, "Designing High Performance Processors Using Real-Address Prediction," IEEE Transactions on Computers, C-42(9), Sep. 1993, pp. 1146-1151.
[12]
D. Hunt, "Advanced Performance Features of the 64-bit PA-8000," Proc. COMPCON'95, Mar. 95, pp. 123-128.
[13]
N. Jouppi, and S. Wilton "Tradeoffs in Two-Level On- Chip Caching," Proc. of ~1st Int'l Syrup. on Computer Architecture, Apr. 1994, pp. 34-45.
[14]
D. Levitan, T. Thomas, and P. Tu, "The PowerPC 620 Microprocessor: A High Performance Superscalar RISC Microprocessor," Proc. COMPCON'95, Mar. 1995, pp. 285-291.
[15]
MIPS Technologies Inc. "R10000 Microprocessor," Product Overview, Oct. 1994.
[16]
J. Mulder, N. Quach, and M. Flynn, "An Area Model for On-Chip Memories and its Application," IEEE Journal of Sol~d-State Circuits, Feb. 1991, Vol. 26(2), pp. 98-106.
[17]
K. Olukotun, T. Mudge, and R. Brown, "Performance Optimization of Pipelined Primary caches," Proc. of 19th Int'l Symp. on Computer Arch,tecture, May 1992, pp. 181-190.
[18]
S. Przybylski, M. Horowitz, and J. Hennessy, "Characteristics of Performance-Optimal Multi-Level Cache Hierarchies," Proc. of 16th Int'l Syrup. on Computer Architecture, May. 1989, pp. 114-121.
[19]
A. Seznec, :'Don't use the page number, but a pointer to it," Proc. of ~3rd Int'l Syrup. on Computer Architecture, May 1996, pp. 104-113.
[20]
A. Smith, "Cache Memories," Computing Surveys, 14(4), Sep. 1982, pp. 473-530.
[21]
K. So, and R. Rechtschaffen, "Cache Operations by MRU Change," IEEE Trans. on Computers, C-37(6), Jun. 1988, pp. 700-709.
[22]
System Performance Evaluation Cooperative, SPEC News-letter, 1990.
[23]
TPC Council, "TPC Benchmark C, Standard Specificat;ion, Rev. 3.0," Feb. 1995.
[24]
T. Wads, $. Rajan, and S. Przybylski, "An Analytical Access Time Model for On-Chip Cache Memories," IEEE Journal of Solid-State Circuits, Vol. 27(8), Aug. 1992, pp. 1147-1156.
[25]
S. Wilton, and N. Jouppi, "An Enhanced Access and Cycle Time Model for On-Chip Caches," DEC WRL Research Report 93//5, Jul. 1994.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGPLAN Notices
ACM SIGPLAN Notices  Volume 31, Issue 9
Sept. 1996
273 pages
ISSN:0362-1340
EISSN:1558-1160
DOI:10.1145/248209
Issue’s Table of Contents
  • cover image ACM Conferences
    ASPLOS VII: Proceedings of the seventh international conference on Architectural support for programming languages and operating systems
    October 1996
    290 pages
    ISBN:0897917677
    DOI:10.1145/237090
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 September 1996
Published in SIGPLAN Volume 31, Issue 9

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)128
  • Downloads (Last 6 weeks)19
Reflects downloads up to 25 Nov 2024

Other Metrics

Citations

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media