Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/139669.139706acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article
Free access

The DASH prototype: implementation and performance

Published: 01 April 1992 Publication History

Abstract

The fundamental premise behind the DASH project is that it is feasible to build large-scale shared-memory multiprocessors with hardware cache coherence. While paper studies and software simulators are useful for understanding many high-level design trade-offs, prototypes are essential to ensure that no critical details are overlooked. A prototype provides convincing evidence of the feasibility of the design allows one to accurately estimate both the hardware and the complexity cost of various features, and provides a platform for studying real workloads. A 16-processor prototype of the DASH multiprocessor has been operational for the last six months. In this paper, the hardware overhead of directory-based cache coherence in the prototype is examined. We also discuss the performance of the system, and the speedups obtained by parallel applications running on the prototype. Using a sophisticated hardware performance monitor, we characterize the effectiveness of coherent caches and the relationship between an application's reference behavior and its speedup.

References

[1]
Agarwal, A., B.-H. Lim, D. Kranz, and J. Kubiatowicz. LimitLESS Directories: A Scalable Cache Coherence Scheme. In Proc. Fourth Int. Conf. on Architectural Support Programming Languages and Operating Systems. pp. 224- 234, 1991.
[2]
Agarwal, A., R. Simoni, J. Hermessy, and M. Horowitz. An Evaluation of Directory Schemes for Cache Coherence. In Proc. 15th Int. Syrup. on Computer Architecture. pp. 280- 289, 1988.
[3]
Baskett, F., T. Jermoluk, and D. Solomon. The 4D-MP Graphics Superworkstation: Computing + Graphics = 40 MIPS + 40 MFLOPS and 100,000 Lighted Polygons per Second. In Proc. Compcon Spring 88. pp. 468-471, 1988.
[4]
Censier, L. and P. Feautrier, A New Solution to Coherence Problems in Multicache Systems. IEEE Trans. on Con~uters, C(27):1112-1118, 1978.
[5]
Flaig, C.M., VLSI Mesh Routing Systems. Technical Report 524 l:TR:87, California Institute of Technology, May 1987.
[6]
Gupta, A., W.-D. Weber, and T. Mowry. Reducing Memory and Traffic Requirements for Scalable Directory-Based Cache Coherence Schemes. In Proc. 1990 Int. Conf. on Parallel Processing. pp. 1:312-321, 1990.
[7]
Lenoski, D., J. Laudon, K. Gharachorloo, A. Gupta, and J. Hennessy. The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor. In Proc. 17th Int. Syrup. on Computer Architecture. pp. 148-159, 1990.
[8]
Lenoski, D., J. Laudon, K. Gharachorloo, W.-D. Weber, A. Gupta, J. Hennessy, M. Horowitz, and M. Larn, The Stanford DASH Multiprocessor. Computer, 25(3), 1992.
[9]
Lenoski, D.E., The Design and Analysis of DASH: A Scalable Directory-Based Multiprocessor. Ph.D. Thesis. Stanford University. 1991. Also available as Stanford University Technical Report CSL-TR-92-507
[10]
Lusk, E., R. Overbeek, J. Boyle, R. Butler, T. Disz, B. Glickfeld, J. Patterson, and R. Stevens, Portable Programs for Parallel Processors. Holt, Rinehard and Winston, Inc.1987.
[11]
O'Krafka, B.W. and A.R. Newton. An Empirical Evaluation of Two Memory-Efficient Directory Methods. In Proc. 17th Int. Syrup. on Computer Architecture. pp. 138-147, 1990.
[12]
Papamarcos, M.S. and J.H. Patel. A Low Overhead Coherence Solution for Multiprocessors with Private Cache Memories. In Proc. 11th Int. Syrup. on Computer Architecture. pp. 348-354, 1984.
[13]
Singh, J.P., C. Holt, T.Totsuka, A. Gupta, andJ.L. Hermessy, Balancing and Data Locality in Parallel N-body Techniques. Technical Report CSL-TR-92-505, Stanford University, 1991.
[14]
Singh, J.P., W.-D. Weber, and A. Gupta, SPLASH: Stanford Parallel Applications for Shared Memory. Technical Report CSL-TR-91-469, Stanford University, 1991.
[15]
Xilinx, The Programmable Gate Array Data Book. 1991.

Cited By

View all
  • (2018)Guidelines for Creating Senior-Friendly Product InstructionsACM Transactions on Accessible Computing10.1145/320988211:2(1-35)Online publication date: 8-Jun-2018
  • (2018)Design and Evaluation of a Collaborative Virtual Environment (CoMove) for Autism Spectrum Disorder InterventionACM Transactions on Accessible Computing10.1145/320968711:2(1-22)Online publication date: 8-Jun-2018
  • (2017)Multidimensional Intratile Parallelization for Memory-Starved Stencil ComputationsACM Transactions on Parallel Computing10.1145/31552904:3(1-32)Online publication date: 18-Dec-2017
  • Show More Cited By

Index Terms

  1. The DASH prototype: implementation and performance

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    ISCA '92: Proceedings of the 19th annual international symposium on Computer architecture
    May 1992
    439 pages
    ISBN:0897915097
    DOI:10.1145/139669
    • cover image ACM SIGARCH Computer Architecture News
      ACM SIGARCH Computer Architecture News  Volume 20, Issue 2
      Special Issue: Proceedings of the 19th annual international symposium on Computer architecture (ISCA '92)
      May 1992
      429 pages
      ISSN:0163-5964
      DOI:10.1145/146628
      Issue’s Table of Contents

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 01 April 1992

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Article

    Conference

    ISCA92
    Sponsor:
    ISCA92: International Conference on Computer Architecture
    May 19 - 21, 1992
    Queensland, Australia

    Acceptance Rates

    Overall Acceptance Rate 543 of 3,203 submissions, 17%

    Upcoming Conference

    ISCA '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)114
    • Downloads (Last 6 weeks)18
    Reflects downloads up to 01 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2018)Guidelines for Creating Senior-Friendly Product InstructionsACM Transactions on Accessible Computing10.1145/320988211:2(1-35)Online publication date: 8-Jun-2018
    • (2018)Design and Evaluation of a Collaborative Virtual Environment (CoMove) for Autism Spectrum Disorder InterventionACM Transactions on Accessible Computing10.1145/320968711:2(1-22)Online publication date: 8-Jun-2018
    • (2017)Multidimensional Intratile Parallelization for Memory-Starved Stencil ComputationsACM Transactions on Parallel Computing10.1145/31552904:3(1-32)Online publication date: 18-Dec-2017
    • (2016)Affinity-Based Thread and Data Mapping in Shared Memory SystemsACM Computing Surveys10.1145/300638549:4(1-38)Online publication date: 5-Dec-2016
    • (2014)Author retrospective for compiler-directed data prefetching in multiprocessors with memory hierarchiesACM International Conference on Supercomputing 25th Anniversary Volume10.1145/2591635.2591644(9-11)Online publication date: 10-Jun-2014
    • (2013)MEMSCALE: Re-architecting Memory Resources for ClustersHigh-Performance Computing Using FPGAs10.1007/978-1-4614-1791-0_19(569-604)Online publication date: 28-Feb-2013
    • (2010)Semi-supervised distance metric learning for collaborative image retrieval and clusteringACM Transactions on Multimedia Computing, Communications, and Applications10.1145/1823746.18237526:3(1-26)Online publication date: 27-Aug-2010
    • (2010)Audio-visual atoms for generic video concept classificationACM Transactions on Multimedia Computing, Communications, and Applications10.1145/1823746.18237486:3(1-19)Online publication date: 27-Aug-2010
    • (2010)Visual query suggestionACM Transactions on Multimedia Computing, Communications, and Applications10.1145/1823746.18237476:3(1-19)Online publication date: 27-Aug-2010
    • (2008)The case for simple, visible cache coherencyProceedings of the 2008 ACM SIGPLAN workshop on Memory systems performance and correctness: held in conjunction with the Thirteenth International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS '08)10.1145/1353522.1353532(31-35)Online publication date: 2-Mar-2008
    • Show More Cited By

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media