Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/998680acmconferencesBook PagePublication PagesiscaConference Proceedingsconference-collections
ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture
2004 Proceeding
Publisher:
  • IEEE Computer Society
  • 1730 Massachusetts Ave., NW Washington, DC
  • United States
Conference:
ISCA04: The 31st Annual International Symposium on Computer Architecture 2004 München Germany June 19 - 23, 2004
ISBN:
978-0-7695-2143-5
Published:
19 June 2004
Sponsors:
Next Conference
Reflects downloads up to 25 Nov 2024Bibliometrics
Abstract

No abstract available.

Article
General Co-Chair's Message
Page .09
Article
Program Chair's Message
Page .10
Article
Committees
Page .11
Article
Reviewers
Page .13
Article
Evaluation of the Raw Microprocessor: An Exposed-Wire-Delay Architecture for ILP and Streams
Page 2

This paper evaluates the Raw microprocessor. Raw addresses thechallenge of building a general-purpose architecture that performswell on a larger class of stream and embedded computing applicationsthan existing microprocessors, while still running ...

Article
Evaluating the Imagine Stream Architecture
Page 14

This paper describes an experimental evaluation of theprototype Imagine stream processor. Imagine [Imagine: Media processing with streams] is a stream processor that employs a two-level register hierarchy with9.7 Kbytes of local register file capacity ...

Article
Field-testing IMPACT EPIC research results in Itanium 2
Page 26

Explicitly-Parallel Instruction Computing (EPIC) providesarchitectural features, including predication and explicitcontrol speculation, intended to enhance the compiler'sability to expose instruction-level parallelism (ILP) incontrol-intensive programs. ...

Article
Wire Delay is Not a Problem for SMT (In the Near Future)
Page 40

Previous papers have shown that the slow scaling of wiredelays compared to logic delays will prevent superscalar performancefrom scaling with technology.In this paper we showthat the optimal pipeline for superscalar becomes shallowerwith technology, ...

Article
The Vector-Thread Architecture
Page 52

The vector-thread (VT) architectural paradigm unifies the vectorand multithreaded compute models. The VT abstraction providesthe programmer with a control processor and a vector of virtualprocessors (VPs). The control processor can use vector-fetch ...

Article
Single-ISA Heterogeneous Multi-Core Architectures for Multithreaded Workload Performance
Page 64

A single-ISA heterogeneous multi-core architecture is achip multiprocessor composed of cores of varying size, performance,and complexity. This paper demonstrates that thisarchitecture can provide significantly higher performance inthe same area than a ...

Article
Microarchitecture Optimizations for Exploiting Memory-Level Parallelism
Page 76

The performance of memory-bound commercial applicationssuch as databases is limited by increasing memory latencies. Inthis paper, we show that exploiting memory-level parallelism(MLP) is an effective approach for improving the performance ofthese ...

Article
Memory Ordering: A Value-Based Approach
Page 90

Conventional out-of-order processors employ a multi-ported,fully-associative load queue to guarantee correctmemory reference order both within a single thread of executionand across threads in a multiprocessor system. Asimprovements in process ...

Article
Transactional Memory Coherence and Consistency
Page 102

In this paper, we propos a new shared memory model: Transactionalmemory Coherence and Consistency (TCC).TCC providesa model in which atomic transactions are always the basicunit of parallel work, communication, memory coherence, andmemory reference ...

Article
TSOtool: A Program for Verifying Memory Systems Using the Memory Consistency Model
Page 114

In this paper, we describe TSOtool, a program to check thebehavior of the memory subsystem in a shared memorymultiprocessor. TSOtool runs pseudo-randomly generatedprograms with data races on a system compliant with theTotal Store Order (TSO) memory ...

Article
SMTp: An Architecture for Next-generation Scalable Multi-threading
Page 124

We introduce the SMTp architecture-an SMT processoraugmented with a coherence protocol thread context,that together with a standard integrated memory controllercan enable the design of (among other possibilities) scalablecache-coherent hardware ...

Article
A Formal Approach to Frequent Energy Adaptations for Multimedia Applications
Page 138

Much research has recently been done on adapting architecturalresources of general-purpose processors to saveenergy at the cost of increased execution time. This workexamines adaptation control algorithms for such processorsrunning real-time multimedia ...

Article
Synchroscalar: A Multiple Clock Domain, Power-Aware, Tile-Based Embedded Processor
Page 150

We present Synchroscalar, a tile-based architecture forembedded processing that is designed to provide the flexibilityof DSPs while approaching the power efficiency ofASICs. We achieve this goal by providing high parallelismand voltage scaling while ...

Article
Power Awareness through Selective Dynamically Optimized Traces
Page 162

We present the PARROT concept that seeks to achievehigher performance with reduced energy consumptionthrough gradual optimization of frequently executed codetraces. The PARROT microarchitectural framework integratestrace caching, dynamic optimizations ...

Article
X-RAY: A Non-Invasive Exclusive Caching Mechanism for RAIDs
Page 176

RAID storage arrays often possess gigabytes of RAM forcaching disk blocks. Currently, most RAID systems use LRUor LRU-like policies to manage these caches. Since these arraycaches do not recognize the presence of file system buffer caches,they ...

Article
Low-Latency Virtual-Channel Routers for On-Chip Networks
Page 188

The on-chip communication requirements of manysystems are best served through the deployment of a regularchip-wide network. This paper presents the design of alow-latency on-chip network router for such applications.We remove control overheads (routing ...

Article
Immunet: A Cheap and Robust Fault-Tolerant Packet Routing Mechanism
Page 198

A new and efficient mechanism to tolerate failures ininterconnection networks for parallel and distributedcomputers, denoted as Immunet, is presented in this work.In the presence of failures, Immunet automatically reactswith a hardware reconfiguration ...

Article
Adaptive Cache Compression for High-Performance Processors
Page 212

Modern processors use two or more levels ofcache memories to bridge the rising disparity betweenprocessor and memory speeds. Compression canimprove cache performance by increasing effectivecache capacity and eliminating misses. However,decompressing ...

Article
iWatcher: Efficient Architectural Support for Software Debugging
Page 224

Recent impressive performance improvements in computer architecturehave not led to significant gains in ease of debugging.Software debugging often relies on inserting run-time softwarechecks. In many cases, however, it is hard to find the root causeof a ...

Article
From Sequences of Dependent Instructions to Functions: An Approach for Improving Performance without ILP or Speculation
Page 238

In this article, we present an approach for improving the performance of sequences of dependent instructions. We observe that many sequences of instructionscan be interpreted as functions. Unlike sequences of instructions, functions can be translated ...

Article
Prophet/Critic Hybrid Branch Prediction
Page 250

This paper introduces the prophet/critic hybrid conditionalbranch predictor, which has two component predictorsthat play the role of either prophet or critic.Theprophet is a conventional predictor that uses branch historyto predict the direction of the ...

Article
Techniques to Reduce the Soft Error Rate of a High-Performance Microprocessor
Page 264

Transient faults due to neutron and alpha particle strikes posea significant obstacle to increasing processor transistor counts infuture technologies. Although fault rates of individual transistorsmay not rise significantly, incorporating more ...

Article
The Case for Lifetime Reliability-Aware Microprocessors
Page 276

Ensuring long processor lifetimes by limiting failuresdue to wear-out related hard errors is a critical requirementfor all microprocessor manufacturers. We observethat continuous device scaling and increasing temperaturesare making lifetime reliability ...

Article
Exploiting Resonant Behavior to Reduce Inductive Noise
Page 288

Inductive noise in high-performance microprocessors is a reliabilityissue caused by variations in processor current (di/dt)which are converted to supply-voltage glitches by impedances inthe power-supply network. Inductive noise has been addressed ...

Article
Use-Based Register Caching with Decoupled Indexing
Page 302

Wide, deep pipelines need many physical registersto hold the results of in-flight instructions. Simultaneously,high clock frequencies prohibit using largeregister files and bypass networks without a significantperformance penalty. Previously proposed ...

Article
A Content Aware Integer Register File Organization
Page 314

A register file is a critical component of a modernsuperscalar processor.It has a large number of entriesand read/write ports in order to enable high levels ofinstruction parallelism.As a result, the register file'sarea, access time, and energy ...

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Acceptance Rates

ISCA '04 Paper Acceptance Rate 31 of 217 submissions, 14%;
Overall Acceptance Rate 543 of 3,203 submissions, 17%
YearSubmittedAcceptedRate
ISCA '224006717%
ISCA '193656217%
ISCA '173225417%
ISCA '132885619%
ISCA '122624718%
ISCA '082593714%
ISCA '062343113%
ISCA '051944523%
ISCA '042173114%
ISCA '031843620%
ISCA '021802715%
ISCA '011632415%
ISCA '991352619%
Overall3,20354317%