Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/1870926.1870953acmconferencesArticle/Chapter ViewAbstractPublication PagesdateConference Proceedingsconference-collections
research-article

Efficient OpenMP data mapping for multicore platforms with vertically stacked memory

Published: 08 March 2010 Publication History

Abstract

Emerging TSV-based 3D integration technologies have shown great promise to overcome scalability limitations in 2D designs by stacking multiple memory dies on top of a many-core die. Application software developers need programming models and tools to fully exploit the potential of vertically stacked memory. In this work, we focus on efficient data mapping for SPMD parallel applications on an explicitly managed 3D-stacked memory hierarchy, which requires placement of data across multiple vertical memory stacks to be carefully optimized. We propose a programming framework with compiler support that enables array partitioning. Partitions are mapped to the 3D-stacked memory on top of the processor that mostly accesses it to take advantage of the lower latencies of vertical interconnect and for minimizing high-latency traffic on the horizontal plane.

References

[1]
U. Kang and et al. 8gb 3d ddr3 dram using through-silicon-via technology. In ISSCC '09 2009.
[2]
M. Kawano et al. Three-dimensional packaging technology for stacked dram with 3-gb/s data transfer. In IEEE TED, 2008.
[3]
3d stacked ddr2 sdram www.tezzaron.com/memory/TSC_Leo_II.htm.
[4]
T. Kgil et al. Picoserver: Using 3d stacking technology to build energy efficient servers. In J. Emerg. Technol. Comput. Syst., 2008.
[5]
G. H. Loh. 3d-stacked memory architectures for multi-core processors. In SIGARCH Comput. Archit. News, 2008.
[6]
P. G. Emma and E. Kursun. Is 3d chip technology the next growth engine for performance improvement? In IBM Journal of Research and Development, 2008.
[7]
B. Black et al. Die stacking (3d) microarchitecture. In MICRO 39.
[8]
Li and et al. 3d integration: Opportunities and challenges. In ISCAS '08, 2008.
[9]
Made in ibm labs: Ibm cools 3-d chips with h2o www-03.ibm.com/press/us/en/pressrelease/24385.wss.
[10]
F. Li et al. Design and management of 3d chip multiprocessors using network-in-memory. In ISCA '06, 2006.
[11]
S. Gu et al. Stackable memory of 3d chip integration for mobile applications. In IEDM '08, 2008.
[12]
M. Facchini et al. System-level power/performance evaluation of 3d stacked drams for mobile applications. In DATE '09, 2009.
[13]
O. Ozturk et al. Optimal topology exploration for application-specific 3d architectures. In ASP-DAC '06, 2006.
[14]
J. H. Kelm et al. SChISM: scalable cache incoherent shared memory CRHC-08-06.
[15]
K. Fatahalian et al. Sequoia: programming the memory hierarchy. In Supercomputing 2006, 2006.
[16]
T. J. Knight et al. Compilation for explicitly managed memory hierarchies. In PPoPP '07, 2007.
[17]
W. W. Carlson et al. Introduction to UPC and language specification CCS-TR-99-157.
[18]
R. Chandra et al. Data distribution support on distributed shared memory multiprocessors In PLDI '97, 1997.
[19]
J. Bircsak et al. Extending OpenMP for NUMA machines. In Supercomputing 2000, 2000.
[20]
B. Chapman et al. Achieving performance under OpenMP on ccNUMA and software distributed shared memory systems. In Concurrency and Computation: Practice and Experience 14, 2002.
[21]
Y. Ojima et al. Performance of cluster-enabled OpenMP for the SCASH software distributed shared memory system. In CCGRID '03, 2003.
[22]
E. Flamand. Strategic directions towards multicore application specific computing. In DATE '09, 2009.

Cited By

View all
  • (2012)Asymmetric DRAM synthesis for heterogeneous chip multiprocessors in 3D-stacked architectureProceedings of the International Conference on Computer-Aided Design10.1145/2429384.2429399(73-80)Online publication date: 5-Nov-2012

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
DATE '10: Proceedings of the Conference on Design, Automation and Test in Europe
March 2010
1868 pages
ISBN:9783981080162

Sponsors

  • EDAA: European Design Automation Association
  • ECSI
  • EDAC: Electronic Design Automation Consortium
  • SIGDA: ACM Special Interest Group on Design Automation
  • The IEEE Computer Society TTTC
  • The IEEE Computer Society DATC
  • The Russian Academy of Sciences: The Russian Academy of Sciences

Publisher

European Design and Automation Association

Leuven, Belgium

Publication History

Published: 08 March 2010

Check for updates

Qualifiers

  • Research-article

Conference

DATE '10
Sponsor:
  • EDAA
  • EDAC
  • SIGDA
  • The Russian Academy of Sciences
DATE '10: Design, Automation and Test in Europe
March 8 - 12, 2010
Germany, Dresden

Acceptance Rates

Overall Acceptance Rate 518 of 1,794 submissions, 29%

Upcoming Conference

DATE '25
Design, Automation and Test in Europe
March 31 - April 2, 2025
Lyon , France

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)2
Reflects downloads up to 23 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2012)Asymmetric DRAM synthesis for heterogeneous chip multiprocessors in 3D-stacked architectureProceedings of the International Conference on Computer-Aided Design10.1145/2429384.2429399(73-80)Online publication date: 5-Nov-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media