Proceedings of the 31st annual international symposium on Computer architecture

ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture

June 2004

2004 Proceeding

Publisher:

IEEE Computer Society
1730 Massachusetts Ave., NW Washington, DC
United States

Conference:

ISCA04: The 31st Annual International Symposium on Computer Architecture 2004 München Germany June 19 - 23, 2004

ISBN:

978-0-7695-2143-5

Published:

19 June 2004

Sponsors:

SIGARCH

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Next Conference

ISCA '25

Sponsor:
sigarch

The 52nd Annual International Symposium on Computer Architecture

June 21 - 25, 2025

Tokyo , Japan

ISCA '25 website

Reflects downloads up to 25 Nov 2024Bibliometrics

Citation Count

2,866

Downloads (6 weeks)

148

Downloads (12 months)

975

Downloads (cumulative)

33,718

Sections

ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture

2004

Previous Next

Abstract

No abstract available.

Select All

Export Citations Save to Binder

Article

General Co-Chair's Message

Page .09

- 0
- 291
Metrics
Total Citations0
Total Downloads291
Last 12 Months4
Last 6 weeks0

Get Access

Article

Program Chair's Message

Page .10

- 0
- 247
Metrics
Total Citations0
Total Downloads247
Last 12 Months3
Last 6 weeks0

Get Access

Article

Committees

Page .11

- 0
- 226
Metrics
Total Citations0
Total Downloads226
Last 12 Months2
Last 6 weeks0

Get Access

Article

Reviewers

Page .13

- 0
- 300
Metrics
Total Citations0
Total Downloads300
Last 12 Months2
Last 6 weeks0

Get Access

Article

Evaluation of the Raw Microprocessor: An Exposed-Wire-Delay Architecture for ILP and Streams

Michael Bedford Taylor,
Walter Lee,
Jason Miller,
David Wentzlaff,
Ian Bratt,
Ben Greenwald,
Henry Hoffmann,
Paul Johnson,
Jason Kim,
James Psota,
Arvind Saraf,
Nathan Shnidman,
Volker Strumpen,
Matt Frank,
Saman Amarasinghe,
Anant Agarwal

Page 2

This paper evaluates the Raw microprocessor. Raw addresses thechallenge of building a general-purpose architecture that performswell on a larger class of stream and embedded computing applicationsthan existing microprocessors, while still running ...

- 264
- 1,596
Metrics
Total Citations264
Total Downloads1,596
Last 12 Months66
Last 6 weeks8

Abstract
Get Access

Article

Evaluating the Imagine Stream Architecture

Jung Ho Ahn,
William J. Dally,
Brucek Khailany,
Ujval J. Kapasi,
Abhishek Das

Page 14

This paper describes an experimental evaluation of theprototype Imagine stream processor. Imagine [Imagine: Media processing with streams] is a stream processor that employs a two-level register hierarchy with9.7 Kbytes of local register file capacity ...

- 96
- 1,020
Metrics
Total Citations96
Total Downloads1,020
Last 12 Months13
Last 6 weeks2

Abstract
Get Access

Article

Field-testing IMPACT EPIC research results in Itanium 2

John W. Sias,
Sain-zee Ueng,
Geoff A. Kent,
Ian M. Steiner,
Erik M. Nystrom,
Wen-mei W. Hwu

Page 26

Explicitly-Parallel Instruction Computing (EPIC) providesarchitectural features, including predication and explicitcontrol speculation, intended to enhance the compiler'sability to expose instruction-level parallelism (ILP) incontrol-intensive programs. ...

- 14
- 602
Metrics
Total Citations14
Total Downloads602
Last 12 Months6
Last 6 weeks0

Abstract
Get Access

Article

Wire Delay is Not a Problem for SMT (In the Near Future)

T. N. Vijaykumar,
Zeshan Chishti

Page 40

Previous papers have shown that the slow scaling of wiredelays compared to logic delays will prevent superscalar performancefrom scaling with technology.In this paper we showthat the optimal pipeline for superscalar becomes shallowerwith technology, ...

- 16
- 513
Metrics
Total Citations16
Total Downloads513
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

Article

The Vector-Thread Architecture

Ronny Krashinsky,
Christopher Batten,
Mark Hampton,
Steve Gerding,
Brian Pharris,
Jared Casper,
Krste Asanovic

Page 52

The vector-thread (VT) architectural paradigm unifies the vectorand multithreaded compute models. The VT abstraction providesthe programmer with a control processor and a vector of virtualprocessors (VPs). The control processor can use vector-fetch ...

- 75
- 1,145
Metrics
Total Citations75
Total Downloads1,145
Last 12 Months45
Last 6 weeks6

Abstract
Get Access

Article

Single-ISA Heterogeneous Multi-Core Architectures for Multithreaded Workload Performance

Rakesh Kumar,
Dean M. Tullsen,
Parthasarathy Ranganathan,
Norman P. Jouppi,
Keith I. Farkas

Page 64

A single-ISA heterogeneous multi-core architecture is achip multiprocessor composed of cores of varying size, performance,and complexity. This paper demonstrates that thisarchitecture can provide significantly higher performance inthe same area than a ...

- 269
- 3,149
Metrics
Total Citations269
Total Downloads3,149
Last 12 Months80
Last 6 weeks8

Abstract
Get Access

Article

Microarchitecture Optimizations for Exploiting Memory-Level Parallelism

Yuan Chou,
Brian Fahs,
Santosh Abraham

Page 76

The performance of memory-bound commercial applicationssuch as databases is limited by increasing memory latencies. Inthis paper, we show that exploiting memory-level parallelism(MLP) is an effective approach for improving the performance ofthese ...

- 178
- 1,883
Metrics
Total Citations178
Total Downloads1,883
Last 12 Months111
Last 6 weeks7

Abstract
Get Access

Article

Memory Ordering: A Value-Based Approach

Harold W. Cain,
Mikko H. Lipasti

Page 90

Conventional out-of-order processors employ a multi-ported,fully-associative load queue to guarantee correctmemory reference order both within a single thread of executionand across threads in a multiprocessor system. Asimprovements in process ...

- 68
- 807
Metrics
Total Citations68
Total Downloads807
Last 12 Months28
Last 6 weeks1

Abstract
Get Access

Article

Transactional Memory Coherence and Consistency

Lance Hammond,
Vicky Wong,
Mike Chen,
Brian D. Carlstrom,
John D. Davis,
Ben Hertzberg,
Manohar K. Prabhu,
Honggo Wijaya,
Christos Kozyrakis,
Kunle Olukotun

Page 102

In this paper, we propos a new shared memory model: Transactionalmemory Coherence and Consistency (TCC).TCC providesa model in which atomic transactions are always the basicunit of parallel work, communication, memory coherence, andmemory reference ...

- 382
- 2,768
Metrics
Total Citations382
Total Downloads2,768
Last 12 Months80
Last 6 weeks10

Abstract
Get Access

Article

TSOtool: A Program for Verifying Memory Systems Using the Memory Consistency Model

Sudheendra Hangal,
Durgam Vahia,
Chaiyasit Manovit,
Juin-Yeu Joseph Lu

Page 114

In this paper, we describe TSOtool, a program to check thebehavior of the memory subsystem in a shared memorymultiprocessor. TSOtool runs pseudo-randomly generatedprograms with data races on a system compliant with theTotal Store Order (TSO) memory ...

- 82
- 688
Metrics
Total Citations82
Total Downloads688
Last 12 Months15
Last 6 weeks2

Abstract
Get Access

Article

SMTp: An Architecture for Next-generation Scalable Multi-threading

Mainak Chaudhuri,
Mark Heinrich

Page 124

We introduce the SMTp architecture-an SMT processoraugmented with a coherence protocol thread context,that together with a standard integrated memory controllercan enable the design of (among other possibilities) scalablecache-coherent hardware ...

- 10
- 1,012
Metrics
Total Citations10
Total Downloads1,012
Last 12 Months7
Last 6 weeks3

Abstract
Get Access

Article

A Formal Approach to Frequent Energy Adaptations for Multimedia Applications

Christopher J. Hughes,
Sarita V. Adve

Page 138

Much research has recently been done on adapting architecturalresources of general-purpose processors to saveenergy at the cost of increased execution time. This workexamines adaptation control algorithms for such processorsrunning real-time multimedia ...

- 13
- 483
Metrics
Total Citations13
Total Downloads483
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

Article

Synchroscalar: A Multiple Clock Domain, Power-Aware, Tile-Based Embedded Processor

John Oliver,
Ravishankar Rao,
Paul Sultana,
Jedidiah Crandall,
Erik Czernikowski,
Leslie W. Jones IV,
Diana Franklin,
Venkatesh Akella,
Frederic T. Chong

Page 150

We present Synchroscalar, a tile-based architecture forembedded processing that is designed to provide the flexibilityof DSPs while approaching the power efficiency ofASICs. We achieve this goal by providing high parallelismand voltage scaling while ...

- 32
- 704
Metrics
Total Citations32
Total Downloads704
Last 12 Months7
Last 6 weeks0

Abstract
Get Access

Article

Power Awareness through Selective Dynamically Optimized Traces

Roni Rosner,
Yoav Almog,
Micha Moffie,
Naftali Schwartz,
Avi Mendelson

Page 162

We present the PARROT concept that seeks to achievehigher performance with reduced energy consumptionthrough gradual optimization of frequently executed codetraces. The PARROT microarchitectural framework integratestrace caching, dynamic optimizations ...

- 33
- 715
Metrics
Total Citations33
Total Downloads715
Last 12 Months4
Last 6 weeks0

Abstract
Get Access

Article

X-RAY: A Non-Invasive Exclusive Caching Mechanism for RAIDs

Lakshmi N. Bairavasundaram,
Muthian Sivathanu,
Andrea C. Arpaci-Dusseau,
Remzi H. Arpaci-Dusseau

Page 176

RAID storage arrays often possess gigabytes of RAM forcaching disk blocks. Currently, most RAID systems use LRUor LRU-like policies to manage these caches. Since these arraycaches do not recognize the presence of file system buffer caches,they ...

- 41
- 483
Metrics
Total Citations41
Total Downloads483
Last 12 Months20
Last 6 weeks0

Abstract
Get Access

Article

Low-Latency Virtual-Channel Routers for On-Chip Networks

Robert Mullins,
Andrew West,
Simon Moore

Page 188

The on-chip communication requirements of manysystems are best served through the deployment of a regularchip-wide network. This paper presents the design of alow-latency on-chip network router for such applications.We remove control overheads (routing ...

- 227
- 1,838
Metrics
Total Citations227
Total Downloads1,838
Last 12 Months46
Last 6 weeks8

Abstract
Get Access

Article

Immunet: A Cheap and Robust Fault-Tolerant Packet Routing Mechanism

V. Puente,
J. A. Gregorio,
F. Vallejo,
R. Beivide

Page 198

A new and efficient mechanism to tolerate failures ininterconnection networks for parallel and distributedcomputers, denoted as Immunet, is presented in this work.In the presence of failures, Immunet automatically reactswith a hardware reconfiguration ...

- 67
- 634
Metrics
Total Citations67
Total Downloads634
Last 12 Months12
Last 6 weeks0

Abstract
Get Access

Article

Adaptive Cache Compression for High-Performance Processors

Alaa R. Alameldeen,
David A. Wood

Page 212

Modern processors use two or more levels ofcache memories to bridge the rising disparity betweenprocessor and memory speeds. Compression canimprove cache performance by increasing effectivecache capacity and eliminating misses. However,decompressing ...

- 137
- 1,622
Metrics
Total Citations137
Total Downloads1,622
Last 12 Months217
Last 6 weeks58

Abstract
Get Access

Article

iWatcher: Efficient Architectural Support for Software Debugging

Pin Zhou,
Feng Qin,
Wei Liu,
Yuanyuan Zhou,
Josep Torrellas

Page 224

Recent impressive performance improvements in computer architecturehave not led to significant gains in ease of debugging.Software debugging often relies on inserting run-time softwarechecks. In many cases, however, it is hard to find the root causeof a ...

- 134
- 731
Metrics
Total Citations134
Total Downloads731
Last 12 Months9
Last 6 weeks2

Abstract
Get Access

Article

From Sequences of Dependent Instructions to Functions: An Approach for Improving Performance without ILP or Speculation

Sami Yehia,
Olivier Temam

Page 238

In this article, we present an approach for improving the performance of sequences of dependent instructions. We observe that many sequences of instructionscan be interpreted as functions. Unlike sequences of instructions, functions can be translated ...

- 40
- 524
Metrics
Total Citations40
Total Downloads524
Last 12 Months3
Last 6 weeks0

Abstract
Get Access

Article

Prophet/Critic Hybrid Branch Prediction

Ayose Falcon,
Jared Stark,
Alex Ramirez,
Konrad Lai,
Mateo Valero

Page 250

This paper introduces the prophet/critic hybrid conditionalbranch predictor, which has two component predictorsthat play the role of either prophet or critic.Theprophet is a conventional predictor that uses branch historyto predict the direction of the ...

- 20
- 983
Metrics
Total Citations20
Total Downloads983
Last 12 Months14
Last 6 weeks1

Abstract
Get Access

Article

Techniques to Reduce the Soft Error Rate of a High-Performance Microprocessor

Christopher Weaver,
Joel Emer,
Shubhendu S. Mukherjee,
Steven K. Reinhardt

Page 264

Transient faults due to neutron and alpha particle strikes posea significant obstacle to increasing processor transistor counts infuture technologies. Although fault rates of individual transistorsmay not rise significantly, incorporating more ...

- 137
- 1,473
Metrics
Total Citations137
Total Downloads1,473
Last 12 Months28
Last 6 weeks6

Abstract
Get Access

Article

The Case for Lifetime Reliability-Aware Microprocessors

Jayanth Srinivasan,
Sarita V. Adve,
Pradip Bose,
Jude A. Rivers

Page 276

Ensuring long processor lifetimes by limiting failuresdue to wear-out related hard errors is a critical requirementfor all microprocessor manufacturers. We observethat continuous device scaling and increasing temperaturesare making lifetime reliability ...

- 167
- 1,607
Metrics
Total Citations167
Total Downloads1,607
Last 12 Months34
Last 6 weeks4

Abstract
Get Access

Article

Exploiting Resonant Behavior to Reduce Inductive Noise

Michael D. Powell,
T. N. Vijaykumar

Page 288

Inductive noise in high-performance microprocessors is a reliabilityissue caused by variations in processor current (di/dt)which are converted to supply-voltage glitches by impedances inthe power-supply network. Inductive noise has been addressed ...

- 28
- 409
Metrics
Total Citations28
Total Downloads409
Last 12 Months3
Last 6 weeks0

Abstract
Get Access

Article

Use-Based Register Caching with Decoupled Indexing

J. Adam Butts,
Gurindar S. Sohi

Page 302

Wide, deep pipelines need many physical registersto hold the results of in-flight instructions. Simultaneously,high clock frequencies prohibit using largeregister files and bypass networks without a significantperformance penalty. Previously proposed ...

- 22
- 478
Metrics
Total Citations22
Total Downloads478
Last 12 Months7
Last 6 weeks1

Abstract
Get Access

Article

A Content Aware Integer Register File Organization

Gonzalez Gonzalez,
Adrian Cristal,
Daniel Ortega,
Alexander Veidenbaum,
Mateo Valero

Page 314

A register file is a critical component of a modernsuperscalar processor.It has a large number of entriesand read/write ports in order to enable high levels ofinstruction parallelism.As a result, the register file'sarea, access time, and energy ...

- 32
- 570
Metrics
Total Citations32
Total Downloads570
Last 12 Months6
Last 6 weeks2

Abstract
Get Access

Save to Binder

Create a New Binder

Name

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

LICS '16: Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science
CSL-LICS '14: Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)
LICS '20: Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science

Acceptance Rates

ISCA '04 Paper Acceptance Rate 31 of 217 submissions, 14%;

Overall Acceptance Rate 543 of 3,203 submissions, 17%

Year	Submitted	Accepted	Rate
ISCA '22	400	67	17%
ISCA '19	365	62	17%
ISCA '17	322	54	17%
ISCA '13	288	56	19%
ISCA '12	262	47	18%
ISCA '08	259	37	14%
ISCA '06	234	31	13%
ISCA '05	194	45	23%
ISCA '04	217	31	14%
ISCA '03	184	36	20%
ISCA '02	180	27	15%
ISCA '01	163	24	15%
ISCA '99	135	26	19%
Overall	3,203	543	17%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Save to Binder

Recommendations

LICS '16: Proceedings of the 31st Annual ACM/IEEE Symposium on Logic in Computer Science

CSL-LICS '14: Proceedings of the Joint Meeting of the Twenty-Third EACSL Annual Conference on Computer Science Logic (CSL) and the Twenty-Ninth Annual ACM/IEEE Symposium on Logic in Computer Science (LICS)

LICS '20: Proceedings of the 35th Annual ACM/IEEE Symposium on Logic in Computer Science

Acceptance Rates