Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/264107.264163acmconferencesArticle/Chapter ViewAbstractPublication PagesiscaConference Proceedingsconference-collections
Article
Free access

VM-based shared memory on low-latency, remote-memory-access networks

Published: 01 May 1997 Publication History

Abstract

Recent technological advances have produced network interfaces that provide users with very low-latency access to the memory of remote machines. We examine the impact of such networks on the implementation and performance of software DSM. Specifically, we compare two DSM systems---Cashmere and TreadMarks---on a 32-processor DEC Alpha cluster connected by a Memory Channel network.Both Cashmere and TreadMarks use virtual memory to maintain coherence on pages, and both use lazy, multi-writer release consistency. The systems differ dramatically, however, in the mechanisms used to track sharing information and to collect and merge concurrent updates to a page, with the result that Cashmere communicates much more frequently, and at a much finer grain.Our principal conclusion is that low-latency networks make DSM based on fine-grain communication competitive with more coarse-grain approaches, but that further hardware improvements will be needed before such systems can provide consistently superior performance. In our experiments, Cashmere scales slightly better than TreadMarks for applications with false sharing. At the same time, it is severely constrained by limitations of the current Memory Channel hardware. In general, performance is better for TreadMarks.

References

[1]
C. Amza, A. L Cox, S. Dwarkadas, P. Keleher, H. Lu, R. Rajamony, W. Yu, and W. Zwaenepoel. TreadMarks: Shared Memory Computing on Networks of Workstations. Computer, 29(2):I 8-28, Feb. I996.]]
[2]
M. Blumrieh, K. Li, R. Alpert, C. Dubnieki, E. Felten, and J. Sandberg. Virtual Memory Mapped Network Interface for the SHR/MP Multieomputer. In Proc. ofthe 21st Intl. Syrup. on ComputerArchitecture, pp. 142-153, Apt. 1994,]]
[3]
N.J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. E. Seitz, J. N. Seizovie, and W.-K. Su. Myrinet: A Gigabit-per-Seeond Local Area Network. In IEEE Micro, pp. 29-36, Feb. 1995.]]
[4]
W.J. Bolosky, M. L. Scott, R. P. Fitzgerald, R. J. Fowler, and A. L. Cox. NUMA Policies and Their Relation to Memory Architecture. In Proc. of the 4th Intl. Col~ on Architectural Support for Programming Languages and Operating Systems, pp. 212-221, Apr. 1991.]]
[5]
G. Buzzard, D. Jacobson, M. Maekey, S. Marovieh, and J. Wilkes. An Implementation of the Hamlyn Sender- Managed Interface Architecture. In Proc, of the 2nd Syrup. on Operating Systems Design and implementation, Oct. 1996.]]
[6]
I.S. Chase, E G. Amador, E. D. Lazowska, H. M. Levy, and R. J. Litflefield. The Amber System: Parallel Programming on a Network of Multiproeessors. In Proe, of the TweIBh ACM Symp. on Operating Systems Principles, pp. 147-158,Dee. 1989.]]
[7]
D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S, Lumetta, T. yon Eieken, and K. Yeliek. Parallel Programruing in Split-C. In Proc. Supercomputing '93, pp. 262- 273, Nov. 1993.]]
[8]
$. Dwarkadas, A. A. Sehaffer, R. W. Cottingham Jr., A, L, Cox, P. Keleher, and W. Zwaenepoel. Parallelization of General Linkage Analysis Problems. Human Heredity, 44:127-141,1994.]]
[9]
S. Dwarkadas, A. L. Cox, and W. Zwaenepoel. An Integrated Compile-Time/Run-Time Software Distributed Shared Memory System. In Proc. of the 7th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, Oct. 1996.]]
[10]
A. Erliehson, N. Nuekolls, G. Chesson, and J. Hennessy. SoftFLASH: Analyzing the Performance of Clustered Distributed Virtual Shared Memory. In Proc. of the 7th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, Oct. 1996.]]
[11]
M.J. Feeley, J. S. Chase, V. R. Narasayya, and H. M. Levy. Integrating Coherency and Recovery in Distributed Systems. In Proc. of the 1st Symp. on Operating Systems Design and Implementation, Nov. 1994.]]
[12]
R. Gillett. Memory Channel: An Optimized Cluster interconnect. IEEE Micro, 16(2), Feb. 1996.]]
[13]
D.B. Gustavson. The Sealable Coherent Interface and Related Standards Projects. IEEE Micro, 12(2):10-22, Feb. 1992.]]
[14]
L.Iftode, C.Dubnieki, E.W.Felten, andK.Li. Improving Release-Consistent Shared VLrmal Memory Using Automatie Update. In Proc. of the 2nd Intt Syrup. on High Performance Computer Architecture, Feb. 1996.]]
[15]
K. L. Johnson, M. E Kaashoek, and D. A. Wallach. CRL: High-Performance All-Software Distributed Shared Memory. In Proc. of the Fifteenth A CM Symp. on Operating Systems Principles, Dee. 1995.]]
[16]
P. Keleher, A. L. Cox, and W. Zwaenepoel. Lazy Release Consistency for Software Distributed Shared Memory. In Proc. of the lPth intL Syrup. on Computer Architecture, pp. 13-21, May 1992.]]
[17]
L.I. Kontothanassis and M. L. Scott. High Performance Software Coherence for Current and Future Architectures. Journal of Parallel and Distributed Computing, 29(2):179-195, Nov. 1995.]]
[18]
L.I. Kontothanassis and M. L. Scott. Using Memory- Mapped Network Interfaces to Improve the Performance of Distributed Shared Memory. In Proc. of the 2nd IntL Syrup. on High Performance ComputerArchitecture, Feb. 1996.]]
[19]
R.P. LaRowe Jr. and C. S. Ellis. Experimental Comparison of Memory Management Policies for NUMA Multiproeessors. ACM Transactions on Computer Systems, 9(4):319-363, Nov. 1991.]]
[20]
D. Lenoski, J. Laudon, K. Ghamehofloo, A. Gupta, and J. Hennessy. The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor. In Proc. of the 17th IntL Symp. on ComputerArchitecture, pp. 148-159, May 1990.]]
[21]
K. Li and P. Hudak. Memory Coherence in Shared V'lrtual Memory Systems. ACM Transactions on Computer Systems, 7(4):321-359, Nov. 1989.]]
[22]
O. Lysne, S. Gjessing, and K. Loehsen. Running the SCI Protocol over HIC Networks. In 2nd Intl. Workshop on SCl-based Low-cos~igh-performance Computing (SCIzzL-2), Mar. 1995.]]
[23]
M. Marchetti, L. Kontothanassis, R. Bianehini, and M. L. Scott. Using Simple Page Placement Policies to Reduce the Cost of Cache Fills in Coherent Shared-Memory Systems. In Proc. of the Ninth Intl. Parallel Processing Symp., Apr. 1995.]]
[24]
R.S. Nikhil. Cid: A Parallel, "Shared-memory" C for Distributed-Memory Machines. In Proc. of the 7th Annual Workshop on Languages and Compilers for Parallel Computing, Aug. 1994.]]
[25]
B. Nitzberg and V. Lo. Distributed Shared Memory: A Survey of Issues and Algorithms. Computer, 24(8):52-60, Aug. 1991.]]
[26]
K. Petersen and K. Li. Cache Coherence for Shared Memory Multiprocessors Based on Virtual Memory Support. In Proc. of the 7th Intl. Parallel Processing Symp., Apr. 1993.]]
[27]
H.S. Sandhu, B. Gamsa, and S. Zhou. The Shared Regions Approach to Software Cache Coherence on Multiprocessors. In Proc. of the 4th ACM Symp. on Principles and Practice of Parallel Programming, May 1993.]]
[28]
D.J. Scales, K. Gharachorloo, and C. A. Thekkath. Shasta: A Low Overhead, Software-Only Approach for Supporting Fine-Grain Shared Memory. In Proc. of the 7th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, Oct. 1996.]]
[29]
I. Schoinas, B. Falsafi, A. R. Lebeck, S. K. Reinhardt, J. R. Larus, and D. A. Wood. Fine-grain Access Control for Distributed Shared Memory. In Proc. of the 6th Intl. Conf. on Architectural Support for Programming Languages and Operating Systems, pp. 297-306, Oct. 1994.]]
[30]
J.P. Singh, W.-D. Weber, and A. Gupta. SPLASH: Stanford Parallel Applications for Shared-Memory. ACM SIGARCH Computer Architecture News, 20(1):5--44, Mar. 1992.]]
[31]
A.S. Tanenbaum, M.E Kaashoek, andH.E.Bal. Parallel Programming Using Shared Objects and Broadcasting. Computer, 25(8):10---19, Aug. 1992.]]
[32]
S.C. Woo, M. Ohara, E. Torrie, J. P. Singh, and A. Gupta. Methodological Considerations and Characterization of the SPLASH-2 Parallel Application Suite. In Proc. of the 22nd Intl. Symp. on Computer Architecture, June 1995.]]
[33]
D. Yeung, J. Kubiatowitcz, and A. Agarwal. MGS: A Multigrain Shared Memory System. In Proc. of the 23rd Intl. Syrup. on Computer Architecture, May, 1996.]]
[34]
M.J. Zekauskas,W. A. Sawdon, and B. N. Bershad. Software Write Detection for Distributed Shared Memory. In Proc. of the 1st Symp. on Operating Systems Design and Implementation, Nov. 1994.]]

Cited By

View all
  • (2018)Google Workloads for Consumer DevicesACM SIGPLAN Notices10.1145/3296957.317317753:2(316-331)Online publication date: 19-Mar-2018
  • (2017)Distributed shared persistent memoryProceedings of the 2017 Symposium on Cloud Computing10.1145/3127479.3128610(323-337)Online publication date: 24-Sep-2017
  • (2013)Design tradeoffs for simplicity and efficient verification in the Execution Migration Machine2013 IEEE 31st International Conference on Computer Design (ICCD)10.1109/ICCD.2013.6657037(145-153)Online publication date: Oct-2013
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ISCA '97: Proceedings of the 24th annual international symposium on Computer architecture
June 1997
350 pages
ISBN:0897919017
DOI:10.1145/264107
  • cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 25, Issue 2
    Special Issue: Proceedings of the 24th annual international symposium on Computer architecture (ISCA '97)
    May 1997
    349 pages
    ISSN:0163-5964
    DOI:10.1145/384286
    Issue’s Table of Contents

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 May 1997

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

ISCA97
Sponsor:

Acceptance Rates

Overall Acceptance Rate 543 of 3,203 submissions, 17%

Upcoming Conference

ISCA '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)90
  • Downloads (Last 6 weeks)28
Reflects downloads up to 18 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2018)Google Workloads for Consumer DevicesACM SIGPLAN Notices10.1145/3296957.317317753:2(316-331)Online publication date: 19-Mar-2018
  • (2017)Distributed shared persistent memoryProceedings of the 2017 Symposium on Cloud Computing10.1145/3127479.3128610(323-337)Online publication date: 24-Sep-2017
  • (2013)Design tradeoffs for simplicity and efficient verification in the Execution Migration Machine2013 IEEE 31st International Conference on Computer Design (ICCD)10.1109/ICCD.2013.6657037(145-153)Online publication date: Oct-2013
  • (2011)Software-Based Cache Coherence with Hardware-Assisted Selective Self-Invalidations Using Bloom FiltersIEEE Transactions on Computers10.1109/TC.2010.15560:4(472-483)Online publication date: 1-Apr-2011
  • (2008)An OS-based alternative to full hardware coherence on tiled CMPs2008 IEEE 14th International Symposium on High Performance Computer Architecture10.1109/HPCA.2008.4658652(355-366)Online publication date: Feb-2008
  • (2007)Analysis of Performance Requirement for Large-Scale InfiniBand-based DVSM SystemThe KIPS Transactions:PartA10.3745/KIPSTA.2007.14-A.4.21514A:4(215-226)Online publication date: 31-Aug-2007
  • (2007)Management of a cooperative cache in grids with grid cache servicesConcurrency and Computation: Practice and Experience10.1002/cpe.121219:16(2141-2155)Online publication date: 10-Jul-2007
  • (2005)Safe futures for JavaACM SIGPLAN Notices10.1145/1103845.109484540:10(439-453)Online publication date: 12-Oct-2005
  • (2005)Lifting sequential graph algorithms for distributed-memory parallel computationACM SIGPLAN Notices10.1145/1103845.109484440:10(423-437)Online publication date: 12-Oct-2005
  • (2005)Relational queries over program tracesACM SIGPLAN Notices10.1145/1103845.109484140:10(385-402)Online publication date: 12-Oct-2005
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media