Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/325164.325124acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article
Free access

Adaptive software cache management for distributed shared memory architectures

Published: 01 May 1990 Publication History

Abstract

An adaptive cache coherence mechanism exploits semantic information about the expected or observed access behavior of particular data objects. We contend that, in distributed shared memory systems, adaptive cache coherence mechanisms will outperform static cache coherence mechanisms. We have examined the sharing and synchronization behavior of a variety of shared memory parallel programs. We have found that the access patterns of a large percentage of shared data objects fall in a small number of categories for which efficient software coherence mechanisms exist. In addition, we have performed a simulation study that provides two examples of how an adaptive caching mechanism can take advantage of semantic information.

References

[1]
Anant Agarwal, Richard L. Sites, and Mark Horowitz. ATUM: A new technique for capturing address traces using microcode. In Proceedings of the 13th Annual International Symposium on Computer Architecture, pages 119-127, June 1986.
[2]
James Archibald. A cache coherence approach for large multiprocessor systems. In International Conference on Supercomputing, pages 337-345, November 1988.
[3]
James Archibald and Jean-Loup Baer. Cache coherence protocols: Evaluation using a multiprocessor simulation model. A CM fiansactions on Computer Systems, 4(4):273-298, November 1986.
[4]
John K. Bennett, John B. Carter, and Willy Zwaenepoel. Munin: Distributed shared memory based on type-specific memory coherence. In Proceedings of the 1990 Conference on Principles and Practice of Parallel Programming, March 1990.
[5]
Brian N. Bershad, Edward D. Lazowska, and Henry M. Levy. PRESTO: A system for objectoriented parallel programming. Software- Practice and Experience, 18(8):713-732, August 1988.
[6]
Jeffrey S. Chase, Franz G. Amador, Edward D. Lazowska, Henry M. Levy, and Richard J. Littlefield. The Amber system: Parallel programming on a network of multiprocessors. In Proceedings of the Twelfth ACMSymposium on Operating Systems Principles, pages 147-158, December 1989.
[7]
Michel Dubois, Christoph Scheurich, and FayC A. Briggs. Synchronization, coherence, and event ordering in multiprocessors. IEEE Computer, 21(2):9-21, February 1988.
[8]
Susan J. Eggers and Randy H. Katz. A characterization of sharing in parallel programs and its application to coherency protocol evaluation. In Proceedings of the 15th Annual International Symposium on Computer Architecture, pages 373-383, May 1988.
[9]
James R. Goodman, Mary K. Vernon, and Philip J. Woest. Efficient synchronization primitives for large-scale cache-coherent multiprocessor. In Proceedings of the 3rd International Conference on Architectural Support for Programming Languages and Systems, April 1989.
[10]
A. R. Karlin, M. S. Manasse, L. Rudolph, and D.D. Sleator. Competitive snoopy caching. In Proceedings of the 16th Annual IEEE Symposium on the Foundations of Computer Science, pages 244-254,1986.
[11]
Kai Li. Private communication. March 1990.
[12]
Kai Li and Paul Hudak. Memory coherence in shared virtual memory systems. ACM Tmnsactions on Computer Systems, 7(4):321-359, November 1989.
[13]
Tom Lovett and Shreekant Thakkar. The Symmetry multiprocessor system. In Proceedings of the 1988 International Conference on Parallel Processing, pages 303-310, August 1988.
[14]
Umakishore Ramachandran and M. Yousef A. Khalidi. An implementation of distributed shared memory. Distributed and Multiprocessor Systems Workshop, pages 21-38, 1989.
[15]
Richard L. Sites and Anant Agarwal. Multipro cessor cache analysis using ATUM. In Proceedings of the 15th Annual International Symposium on Computer Architecture, pages 186195, June 1988.
[16]
K. So, F. DaremacRogers, D. George, V.A. Norton, and G.F. Pfister. PSTMUL: A system for parallel simulation of the execution of parallel programs. Technical Report RC11674, IBM Research, 1986.
[17]
Bjarne Stroustrup. The C++ Programming Language. Addison-Wesley, 1.987.
[18]
Wolf-Dietrich Weber and Anoop Gupta. Analysis of cache invalidation patterns in multiprocessors. In Proceedings of the 3rd International Conference on Architectural Support for I'rogramming Languages and Systems, pages 2,43- 256, April 1989.

Cited By

View all
  • (2021)A software cache autotuning strategy for dataflow computing with UPC++ DepSpawnComputational and Mathematical Methods10.1002/cmm4.1148Online publication date: 22-Feb-2021
  • (2012)Asymmetric Cache CoherencyACM Transactions on Reconfigurable Technology and Systems10.1145/2362374.23623765:3(1-12)Online publication date: 1-Oct-2012
  • (2009)A communication characterisation of Splash-2 and ParsecProceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC)10.1109/IISWC.2009.5306792(86-97)Online publication date: 4-Oct-2009
  • Show More Cited By

Index Terms

  1. Adaptive software cache management for distributed shared memory architectures

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      ISCA '90: Proceedings of the 17th annual international symposium on Computer Architecture
      May 1990
      378 pages
      ISBN:0897913663
      DOI:10.1145/325164

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 May 1990

      Permissions

      Request permissions for this article.

      Check for updates

      Qualifiers

      • Article

      Conference

      ISCA90
      Sponsor:
      ISCA90: International Symposium on Computer Architecture
      May 28 - 31, 1990
      Washington, Seattle, USA

      Acceptance Rates

      Overall Acceptance Rate 543 of 3,203 submissions, 17%

      Upcoming Conference

      ISCA '25

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)86
      • Downloads (Last 6 weeks)17
      Reflects downloads up to 21 Sep 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2021)A software cache autotuning strategy for dataflow computing with UPC++ DepSpawnComputational and Mathematical Methods10.1002/cmm4.1148Online publication date: 22-Feb-2021
      • (2012)Asymmetric Cache CoherencyACM Transactions on Reconfigurable Technology and Systems10.1145/2362374.23623765:3(1-12)Online publication date: 1-Oct-2012
      • (2009)A communication characterisation of Splash-2 and ParsecProceedings of the 2009 IEEE International Symposium on Workload Characterization (IISWC)10.1109/IISWC.2009.5306792(86-97)Online publication date: 4-Oct-2009
      • (2007)Memory SystemsundefinedOnline publication date: 10-Sep-2007
      • (2005)The Java memory modelACM SIGPLAN Notices10.1145/1047659.104033640:1(378-391)Online publication date: 12-Jan-2005
      • (2005)The Java memory modelProceedings of the 32nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages10.1145/1040305.1040336(378-391)Online publication date: 12-Jan-2005
      • (2004)On the correctness of program execution when cache coherence is maintained locally at data-sharing boundaries in distributed shared memory multiprocessorsInternational Journal of Parallel Programming10.1023/B:IJPP.0000038070.79088.0b32:5(415-446)Online publication date: 1-Oct-2004
      • (2003)Compiler-decided dynamic memory allocation for scratch-pad based embedded systemsProceedings of the 2003 international conference on Compilers, architecture and synthesis for embedded systems10.1145/951710.951747(276-286)Online publication date: 30-Oct-2003
      • (2003)Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessorsACM SIGARCH Computer Architecture News10.1145/871656.85964231:2(206-217)Online publication date: 1-May-2003
      • (2003)Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessorsProceedings of the 30th annual international symposium on Computer architecture10.1145/859618.859642(206-217)Online publication date: 9-Jun-2003
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Get Access

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media