Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3145617.3158214acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
short-paper

Evaluating GPGPU Memory Performance Through the C-AMAT Model

Published: 12 November 2017 Publication History

Abstract

General Purpose Graphics Processing Units (GPGPU) have become a popular platform to accelerate high performance applications. Although they provide exceptional computing power, GPGPU impose significant pressure on the off-chip memory system. Evaluating, understanding, and improving GPGPU data access delay has become an important research topic in high-performance computing. In this study, we utilize the newly proposed GPGPU/C-AMAT (Concurrent Average Memory Access Time) model to quantitatively evaluate GPGPU memory performance. Specifically, we extend the current C-AMAT model to include a GPGPU-specific modeling component and then provide its evaluation results.

References

[1]
X.-H. Sun and D. Wang, "Concurrent Average Memory Access Time," Computer, vol. 47, no. 5, pp. 74--80, 2014.
[2]
A. Jog, O. Kayiran, N.C. Nachiappan, and et al. "OWL: Cooperative Thread Array Aware Scheduling Techniques for Improving GPGPU Performance," SIGPLAN Vol.48, pp. 395--406, 2013.
[3]
M. Lee et al., "Improving GPGPU Resource Utilization through Alternative Thread Block Scheduling," IEEE 20th International Symposium on High Performance Computer Architecture (HPCA), Orlando, FL, pp. 260--271, 2014.
[4]
P. Xiang, Y. Yang, and H. Zhou, "Warp-level divergence in gpus: Characterization, impact, and mitigation," in Proceedings of 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA-20), 2014.
[5]
O. Kayiran, A. Jog, M. T. Kandemir, and C. R. Das, "Neither more nor less: Optimizing thread-level parallelism for gpgpus," in Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques (PACT'13), 2013.
[6]
M. Lee, S. Song, J. Moon, J. Kim, W. Seo, Y. Cho, and S. Ryu, "Improving gpgpu resource utilization through alternative thread block scheduling," in Proceedings of 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA-20), 2014.
[7]
S.-Y. Lee, A. Arunkumar, and C.-J. Wu, "Cawa: coordinated warp scheduling and cache prioritization for critical warp acceleration of gpgpu workloads," in Proceedings of the 42nd Annual International Symposium on Computer Architecture (ISCA-42), 2015.
[8]
J. Wang and S. Yalamanchili, "Characterization and analysis of dynamic parallelism in unstructured gpu applications," in Proceedings of 2014 IEEE International Symposium on Workload Characterization (IISWC'14), 2014.
[9]
J. Wang, N. Rubin, A. Sidelnik, and S. Yalamanchili, "Dynamic thread block launch: A lightweight execution mechanism to support irregular applications on gpus," in Proceedings of the 42nd Annual International Symposium on Computer Architecuture (ISCA-42), 2015.
[10]
G. Chen and X. Shen, "Free launch: Optimizing gpu dynamic kernel launches through thread reuse," in Proceedings of the 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-48), 2015.
[11]
Wulf, W. A. and Mckee, S. A. 1995. Hitting the Memory Wall: Implications of the Obvious. ACM SIGARCH computer architecture news 23, 1, 20--24.
[12]
X.-H. Sun, "Concurrent-AMAT: A Mathematical Model for Big Data access," HPC Magazine, 2014.
[13]
N. K. Govindaraju, S. Larsen, J. Gray, and D. Manocha. A memory model for scientific algorithms on graphics processors. In SC, 2006
[14]
S. B. Weiguo Liu, Muller-Wittig. Performance predictions for general-purpose computation on gpus. 2007.
[15]
J. Kessenich, D. Baldwin, and R. Rost. The OpenGL shading language. http://www.opengl.org/documentation.

Cited By

View all
  • (2023)The Memory-Bounded Speedup Model and Its Impacts in ComputingJournal of Computer Science and Technology10.1007/s11390-022-2911-138:1(64-79)Online publication date: 31-Jan-2023
  • (2021)A Study on Modeling and Optimization of Memory SystemsJournal of Computer Science and Technology10.1007/s11390-021-0771-836:1(71-89)Online publication date: 30-Jan-2021
  • (2020)Performance Modeling and Evaluation of a Production Disaggregated Memory SystemProceedings of the International Symposium on Memory Systems10.1145/3422575.3422795(223-232)Online publication date: 28-Sep-2020
  • Show More Cited By

Index Terms

  1. Evaluating GPGPU Memory Performance Through the C-AMAT Model

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    MCHPC'17: Proceedings of the Workshop on Memory Centric Programming for HPC
    November 2017
    43 pages
    ISBN:9781450351317
    DOI:10.1145/3145617
    © 2017 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the United States Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 12 November 2017

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. C-AMAT
    2. GPGPU
    3. Memory Performance Evaluation

    Qualifiers

    • Short-paper
    • Research
    • Refereed limited

    Conference

    SC '17
    Sponsor:

    Upcoming Conference

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)3
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 26 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)The Memory-Bounded Speedup Model and Its Impacts in ComputingJournal of Computer Science and Technology10.1007/s11390-022-2911-138:1(64-79)Online publication date: 31-Jan-2023
    • (2021)A Study on Modeling and Optimization of Memory SystemsJournal of Computer Science and Technology10.1007/s11390-021-0771-836:1(71-89)Online publication date: 30-Jan-2021
    • (2020)Performance Modeling and Evaluation of a Production Disaggregated Memory SystemProceedings of the International Symposium on Memory Systems10.1145/3422575.3422795(223-232)Online publication date: 28-Sep-2020
    • (2018)FC-AMATInnovations in Systems and Software Engineering10.1007/s11334-018-0313-x14:2(143-156)Online publication date: 1-Jun-2018

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media