Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/IPDPS.2012.117guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

PARDA: A Fast Parallel Reuse Distance Analysis Algorithm

Published: 21 May 2012 Publication History

Abstract

Reuse distance is a well established approach to characterizing data cache locality based on the stack histogram model. This analysis so far has been restricted to offline use due to the high cost, often several orders of magnitude larger than the execution time of the analyzed code. This paper presents the first parallel algorithm to compute accurate reuse distances by analysis of memory address traces. The algorithm uses a tunable parameter that enables faster analysis when the maximum needed reuse distance is limited by a cache size upper bound. Experimental evaluation using the SPEC CPU 2006 benchmark suite shows that, using 64 processors and a cache bound of 8 MB, it is possible to perform reuse distance analysis with full accuracy within a factor of 13 to 50 times the original execution times of the benchmarks.

Cited By

View all
  • (2024)TTLs Matter: Efficient Cache Sizing with TTL-Aware Miss Ratio Curves and Working Set SizesProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3650066(387-404)Online publication date: 22-Apr-2024
  • (2023)LLVM Static Analysis for Program Characterization and Memory Reuse Profile EstimationProceedings of the International Symposium on Memory Systems10.1145/3631882.3631885(1-6)Online publication date: 2-Oct-2023
  • (2023)Increment - and - Freeze: Every Cache, Everywhere, All of the TimeProceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3558481.3591085(129-139)Online publication date: 17-Jun-2023
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings
IPDPS '12: Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium
May 2012
1402 pages
ISBN:9780769546759

Publisher

IEEE Computer Society

United States

Publication History

Published: 21 May 2012

Author Tags

  1. Caching
  2. Data Locality
  3. LRU Stack Distance
  4. Performance Analysis
  5. Reuse Distance

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)TTLs Matter: Efficient Cache Sizing with TTL-Aware Miss Ratio Curves and Working Set SizesProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3650066(387-404)Online publication date: 22-Apr-2024
  • (2023)LLVM Static Analysis for Program Characterization and Memory Reuse Profile EstimationProceedings of the International Symposium on Memory Systems10.1145/3631882.3631885(1-6)Online publication date: 2-Oct-2023
  • (2023)Increment - and - Freeze: Every Cache, Everywhere, All of the TimeProceedings of the 35th ACM Symposium on Parallelism in Algorithms and Architectures10.1145/3558481.3591085(129-139)Online publication date: 17-Jun-2023
  • (2022)MemSweeper: virtualizing cluster memory management for high memory utilization and isolationProceedings of the 2022 ACM SIGPLAN International Symposium on Memory Management10.1145/3520263.3534651(15-28)Online publication date: 14-Jun-2022
  • (2021)Hybrid, scalable, trace-driven performance modeling of GPGPUsProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3458817.3476221(1-15)Online publication date: 14-Nov-2021
  • (2021)DFShardsProceedings of the 18th ACM International Conference on Computing Frontiers10.1145/3457388.3458810(63-72)Online publication date: 11-May-2021
  • (2021)Machine Learning–enabled Scalable Performance Prediction of Scientific CodesACM Transactions on Modeling and Computer Simulation10.1145/345026431:2(1-28)Online publication date: 23-Apr-2021
  • (2021)Penalty- and Locality-aware Memory Allocation in Redis Using Enhanced AETACM Transactions on Storage10.1145/344757317:2(1-45)Online publication date: 28-May-2021
  • (2020)PPT-SASMM: Scalable Analytical Shared Memory ModelProceedings of the International Symposium on Memory Systems10.1145/3422575.3422806(341-351)Online publication date: 28-Sep-2020
  • (2020)A Locality Optimizer for Loop-dominated Applications Based on Reuse Distance AnalysisACM Transactions on Design Automation of Electronic Systems10.1145/339818925:6(1-26)Online publication date: 2-Sep-2020
  • Show More Cited By

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media