ABSTRACT - Kaushik Mishra (Cache)

Uploaded by

This paper proposes a novel scalable cache miss handling architecture (MHA) design for high-memory level parallelism (MLP) processors. The design introduces a hierarchical MHA with small per-bank Miss Status Holding Registers (MSHRs) and a larger shared MSHR, using a Bloom filter to reduce searches. This results in a high performance, area-efficient design. It provides speedups of 32%, 50%, and 95% over a state-of-the-art MHA for SPECint, SPECfp, and multiprogrammed workloads respectively, and 1-18% and 10-21% over extrapolated alternatives consuming the same area.

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

ABSTRACT - Kaushik Mishra (Cache)

Uploaded by

basab

0% found this document useful (0 votes)

24 views1 page

Original Description:

Kaushik Mishra

Original Title

ABSTRACT- Kaushik Mishra (Cache)

Copyright

Available Formats

DOCX, PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

0% found this document useful (0 votes)

24 views1 page

ABSTRACT - Kaushik Mishra (Cache)

Uploaded by

basab

Copyright:

Attribution Non-Commercial (BY-NC)

Available Formats

Download as DOCX, PDF, TXT or read online from Scribd

Download as docx, pdf, or txt

Jump to Page

You are on page 1of 1

Search inside document

Scalable Cache Miss Handling for High Memory-Level Parallelism

Recently-proposed processor microarchitectures for high Memory Level Parallelism (MLP) promise substantial performance gains. Unfortunately, current cache hierarchies have Miss-Handling Architectures (MHAs) that are too limited to support the required MLP they need to be redesigned to support 1-2 orders of magnitude more outstanding misses. Yet, designing scalable MHAs is challenging: designs must minimize cache lock-up time and deliver high bandwidth while keeping the area consumption reasonable. This paper presents a novel scalable MHA design for high-MLP processors. Our design introduces two main innovations. First, it is hierarchical, with a small MSHR(Miss Status Holding Register) file per cache bank, and a larger MSHR file shared by all banks. Second, it uses a Bloom filter to reduce searches in the larger MSHR file. The result is a high performance, area-efficient design. Compared to a stateof-the-art MHA on a high-MLP processor, our design speeds-up some SPECint, SPECfp, and multiprogrammed workloads by a geometric mean of 32%, 50%, and 95%, respectively. Moreover, compared to two extrapolations of current MHA designs, namely a large monolithic MSHR file and a large banked MSHR file, all consuming the same area, our design speeds-up the workloads by a geometric mean of 1-18% and 10-21%, respectively. Finally, our design performs very close to an unlimited-size, ideal MHA.

Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
DSMP Whitepaper
Document10 pages
DSMP Whitepaper
aschecastillo
No ratings yet
MRPB: Memory Request Prioritization For Massively Parallel Processors
Document12 pages
MRPB: Memory Request Prioritization For Massively Parallel Processors
Nishant Panigrahi
No ratings yet
Dagatan Nino PR
Document12 pages
Dagatan Nino PR
dagatan.nino
No ratings yet
Article
Document14 pages
Article
laxmipoudel1116
No ratings yet
Electrical Engineering and Computer Science Department: Chip Multiprocessor Cooperative Cache Compression and Migration
Document23 pages
Electrical Engineering and Computer Science Department: Chip Multiprocessor Cooperative Cache Compression and Migration
eecs.northwestern.edu
No ratings yet
Scheduling Threads For Constructive Cache Sharing On Cmps
Document11 pages
Scheduling Threads For Constructive Cache Sharing On Cmps
Anonymous RrGVQj
No ratings yet
Non Inclusive Caches
Document10 pages
Non Inclusive Caches
John
No ratings yet
Memory Hierarchy SMT
Document8 pages
Memory Hierarchy SMT
Rajat
No ratings yet
Compute Caches: Ntroduction
Document12 pages
Compute Caches: Ntroduction
Goblen
No ratings yet
Literature Review of Cache Memory
Document7 pages
Literature Review of Cache Memory
afmzhuwwumwjgf
100% (1)
Nec S6X
Document22 pages
Nec S6X
Léia de Sousa
No ratings yet
Parallel Processing Supercomputers
Document2 pages
Parallel Processing Supercomputers
Ankit Singh Payal
No ratings yet
Cache: Why Level It: Departamento de Informática, Universidade Do Minho 4710 - 057 Braga, Portugal Nunods@ipb - PT
Document8 pages
Cache: Why Level It: Departamento de Informática, Universidade Do Minho 4710 - 057 Braga, Portugal Nunods@ipb - PT
sothymohan1293
No ratings yet
Research Article: Memory Map: A Multiprocessor Cache Simulator
Document13 pages
Research Article: Memory Map: A Multiprocessor Cache Simulator
Muhammad Tehseen Khan
No ratings yet
Evaluating Stream Buffers As A Secondary Cache Replacement
Document10 pages
Evaluating Stream Buffers As A Secondary Cache Replacement
Vicent Selfa Oliver
No ratings yet
Smart Memories A Seminar Report: A.Rana Pratap (14761A0506)
Document15 pages
Smart Memories A Seminar Report: A.Rana Pratap (14761A0506)
Hari Krish
No ratings yet
Smart Memories
Document11 pages
Smart Memories
Hari Krish
No ratings yet
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
Document11 pages
Week 5 - The Impact of Multi-Core Computing On Computational Optimization
Game Account
No ratings yet
Dense Footprint Cache: Capacity-Efficient Die-Stacked DRAM Last Level Cache
Document13 pages
Dense Footprint Cache: Capacity-Efficient Die-Stacked DRAM Last Level Cache
fowade dikembo
No ratings yet
Cache Memory Thesis
Document5 pages
Cache Memory Thesis
jenniferwrightclarksville
100% (2)
Multicore Framework: An Api For Programming Heterogeneous Multicore Processors
Document7 pages
Multicore Framework: An Api For Programming Heterogeneous Multicore Processors
AlephNull
No ratings yet
Adaptive Optimization For HP 3PAR Storage White Paper
Document10 pages
Adaptive Optimization For HP 3PAR Storage White Paper
saleem_mek
No ratings yet
Design and implementation of the memory management unit (MMU) of a 32-bit micro-controller; split cache of 32/32kByte; 4-way set-associative, LFU, Write-Through / Write-Allocate. With an ARM926EJ-S with 1GHz clock speed of unlimited main memory with a clock of 10MHz.
Document19 pages
Design and implementation of the memory management unit (MMU) of a 32-bit micro-controller; split cache of 32/32kByte; 4-way set-associative, LFU, Write-Through / Write-Allocate. With an ARM926EJ-S with 1GHz clock speed of unlimited main memory with a clock of 10MHz.
Muhammad Umair Saleem
No ratings yet
Efficient STT-RAM Last-Level-Cache Architecture To Replace DRAM Cache
Document10 pages
Efficient STT-RAM Last-Level-Cache Architecture To Replace DRAM Cache
KARAN KARHALE
No ratings yet
18bce2429 Da 2 Cao
Document13 pages
18bce2429 Da 2 Cao
Latera Gonfa
No ratings yet
Computer System and Architecture
Document17 pages
Computer System and Architecture
Deepak Kumar Gupta
No ratings yet
Dynamic Cache
Document2 pages
Dynamic Cache
aswin556
No ratings yet
Efficient Memory Mapped File IO For In-Memory File Systems
Document6 pages
Efficient Memory Mapped File IO For In-Memory File Systems
Peng Xiao
No ratings yet
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
Document73 pages
CHAPTER 2 Memory Hierarchy Design & APPENDIX B. Review of Memory Heriarchy
Rachmadio Nayub Lazuardi
No ratings yet
Fast14 Paper Rumble
Document17 pages
Fast14 Paper Rumble
btman131
No ratings yet
MTP 01 Final J.raghunat b15216
Document10 pages
MTP 01 Final J.raghunat b15216
Raghunath Jeyaraman
No ratings yet
Simultaneous Multithreading Processor
Document4 pages
Simultaneous Multithreading Processor
simon sylvester
No ratings yet
Reconfigurable Cache Architecture: Major Technical Project On
Document9 pages
Reconfigurable Cache Architecture: Major Technical Project On
Raghunath Jeyaraman
No ratings yet
Research Paper On Cache Memory
Document8 pages
Research Paper On Cache Memory
pib0b1nisyj2
100% (1)
Cache Memory Scheme
Document7 pages
Cache Memory Scheme
jainvidisha
No ratings yet
MDB: A Memory-Mapped Database and Backend For Openldap
Document12 pages
MDB: A Memory-Mapped Database and Backend For Openldap
Daniel Stiven Tamayo
No ratings yet
Multicore Computers
Document18 pages
Multicore Computers
Mikias Yimer
No ratings yet
Cache Memory Term Paper
Document6 pages
Cache Memory Term Paper
afdttricd
100% (1)
Multicore Computers
Document21 pages
Multicore Computers
mikiasyimer7362
No ratings yet
APC: Self-Tuning, Low Overhead Replacement Cache
Document16 pages
APC: Self-Tuning, Low Overhead Replacement Cache
Serge
No ratings yet
A Survey of Cache Coherence Mechanisms in Shared M
Document27 pages
A Survey of Cache Coherence Mechanisms in Shared M
sahasubhajit32102
No ratings yet
Memory Latency
Document7 pages
Memory Latency
pinseeker
No ratings yet
1 What Is A Thread
Document5 pages
1 What Is A Thread
khandesigncrowd
No ratings yet
CA Paper 2
Document27 pages
CA Paper 2
Javaria Rasul
No ratings yet
A Survey On Computer System Memory Management
Document7 pages
A Survey On Computer System Memory Management
Muhammad Abu Bakar Siddik
No ratings yet
Linearly Compressed Pages
Document13 pages
Linearly Compressed Pages
Rishi Shah
No ratings yet
Design and Implementation of A Cache Hierarchy-Aware Task Scheduling For Parallel Loops On Multicore Architectures
Document13 pages
Design and Implementation of A Cache Hierarchy-Aware Task Scheduling For Parallel Loops On Multicore Architectures
CS & IT
No ratings yet
CSC204 - Chapter 3.2 OS Performance Issue (Memory Management) - New
Document40 pages
CSC204 - Chapter 3.2 OS Performance Issue (Memory Management) - New
anekumek
No ratings yet
Moneta: A High-Performance Storage Array Architecture For Next-Generation, Non-Volatile Memories
Document11 pages
Moneta: A High-Performance Storage Array Architecture For Next-Generation, Non-Volatile Memories
mpramsheed
No ratings yet
MM 3
Document11 pages
MM 3
Doğuş Kantarci
No ratings yet
Assignment4-Rennie Ramlochan
Document7 pages
Assignment4-Rennie Ramlochan
Rennie Ramlochan
No ratings yet
Cache Perform Anse
Document6 pages
Cache Perform Anse
Dado Fabrička Greška
No ratings yet
Term Paper: Cahe Coherence Schemes
Document12 pages
Term Paper: Cahe Coherence Schemes
Vinay Garg
No ratings yet
Multi-Level Cell Flash Memory Storage Systems: Amarnath Gaini Sathish Mothe K Vijayalaxmi
Document7 pages
Multi-Level Cell Flash Memory Storage Systems: Amarnath Gaini Sathish Mothe K Vijayalaxmi
Amarnath Gaini
No ratings yet
Report
Document7 pages
Report
Afhad Sliman
No ratings yet
Os nOTES
Document21 pages
Os nOTES
BARATH
No ratings yet
InfiniScaleStorage TAR
Document57 pages
InfiniScaleStorage TAR
joshifamily
No ratings yet
Jamshed 2015
Document17 pages
Jamshed 2015
Jibin Matthew Joy
No ratings yet
Memory Optimization Technique
Document1 page
Memory Optimization Technique
gopikapk
No ratings yet
Faculty of Engineering & Technology: Siksha O' Anusandhan University
Document2 pages
Faculty of Engineering & Technology: Siksha O' Anusandhan University
basab
No ratings yet
Du Section List of The Deemed Universities Getting Plan Grant From Ugc S. NO. Name of The University
Document2 pages
Du Section List of The Deemed Universities Getting Plan Grant From Ugc S. NO. Name of The University
basab
No ratings yet
Cyclo Converter
Document46 pages
Cyclo Converter
basab
No ratings yet
Cyclo Converter 1
Document46 pages
Cyclo Converter 1
basab
No ratings yet
Types of Financial Statements: MG647 Innovation and Entrepreneurship
Document5 pages
Types of Financial Statements: MG647 Innovation and Entrepreneurship
basab
No ratings yet
AAP Can Take A Leaf Out of Amartya Sen's Book: Calm Logic Will Decide The 2014 Election
Document1 page
AAP Can Take A Leaf Out of Amartya Sen's Book: Calm Logic Will Decide The 2014 Election
basab
No ratings yet
Creating Bode Plots From A Transfer Function
Document7 pages
Creating Bode Plots From A Transfer Function
basab
No ratings yet
Free Fall: A Matter of Gravity: PHY1004W
Document1 page
Free Fall: A Matter of Gravity: PHY1004W
basab
No ratings yet
1A. P. J. Abdul Kalam
Document2 pages
1A. P. J. Abdul Kalam
basab
No ratings yet
Circuits Lab
Document6 pages
Circuits Lab
basab
No ratings yet
ElectricDrive Spring2012 Mid
Document2 pages
ElectricDrive Spring2012 Mid
basab
No ratings yet
The Hindu Online E-Paper Service - Payment Receipt
Document1 page
The Hindu Online E-Paper Service - Payment Receipt
basab
No ratings yet
Bartaman Patrika - Editorial Page
Document2 pages
Bartaman Patrika - Editorial Page
basab
No ratings yet
Bartaman Patrika - Editorial Page2
Document2 pages
Bartaman Patrika - Editorial Page2
basab
No ratings yet
Himalayan Diplomacy: The Times of India Epaper
Document1 page
Himalayan Diplomacy: The Times of India Epaper
basab
No ratings yet
Subscriber Form English
Document1 page
Subscriber Form English
basab
No ratings yet
S O' A University: Institute of Technical Education & Research
Document3 pages
S O' A University: Institute of Technical Education & Research
basab
No ratings yet
Mutual Fund Service System Facility (MFSS) Client Registration Form
Document4 pages
Mutual Fund Service System Facility (MFSS) Client Registration Form
basab
No ratings yet
Even Sem13 - Internal - M.tech
Document3 pages
Even Sem13 - Internal - M.tech
basab
No ratings yet
03.10.13 M.tech-1st PDF
Document20 pages
03.10.13 M.tech-1st PDF
basab
No ratings yet
J-970 Basic Circuit Analysis PDF
Document1 page
J-970 Basic Circuit Analysis PDF
basab
0% (1)
Department of Electronics and Electrical Engineering
Document3 pages
Department of Electronics and Electrical Engineering
basab
No ratings yet