Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/956417acmconferencesBook PagePublication PagesmicroConference Proceedingsconference-collections
MICRO 36: Proceedings of the 36th annual IEEE/ACM International Symposium on Microarchitecture
2003 Proceeding
Publisher:
  • IEEE Computer Society
  • 1730 Massachusetts Ave., NW Washington, DC
  • United States
Conference:
MICRO-36: The 36th Annual International Symposium on MicroarchitectureDecember 3 - 5, 2003
ISBN:
978-0-7695-2043-8
Published:
03 December 2003
Sponsors:

Reflects downloads up to 22 Nov 2024Bibliometrics
Abstract

No abstract available.

Article
Message from the General Chair
Page .09
Article
Message from the Program Chair
Page .10
Article
Committees
Page .11
Article
Reviewers
Page .13
Article
Microarchitecture on the MOSFET Diet
Page 3
Article
Razor: A Low-Power Pipeline Based on Circuit-Level Timing Speculation
Page 7

With increasing clock frequencies and silicon integration,power aware computing has become a critical concernin the design of embedded processors and systems-on-chip.One of the more effective and widely used methods for power-awarecomputing is dynamic ...

Article
VSV: L2-Miss-Driven Variable Supply-Voltage Scaling for Low Power
Page 19

Energy-efficient processor design is becoming moreand more important with technology scaling and with highperformance requirements. Supply-voltage scaling is anefficient way to reduce energy by lowering the operatingvoltage and the clock frequency of ...

Article
A Systematic Methodology to Compute the Architectural Vulnerability Factors for a High-Performance Microprocessor
Page 29

Single-event upsets from particle strikes have become akey challenge in microprocessor design. Techniques todeal with these transient faults exist, but come at a cost.Designers clearly require accurate estimates of processorerror rates to make ...

Article
TLC: Transmission Line Caches
Page 43

It is widely accepted that the disproportionate scalingof transistor and conventional on-chip interconnect performancepresents a major barrier to future high performancesystems. Previous research has focused on wire-centricdesigns that use parallelism, ...

Article
Distance Associativity for High-Performance Energy-Efficient Non-Uniform Cache Architectures
Page 55

Wire delays continue to grow as the dominant component oflatency for large caches.A recent work proposed an adaptive,non-uniform cache architecture (NUCA) to manage large, on-chipcaches.By exploiting the variation in access time acrosswidely-spaced ...

Article
Near-Optimal Precharging in High-Performance Nanoscale CMOS Caches
Page 67

High-performance caches statically pull up the bit-linesin all cache subarrays to optimize cache accesslatency. Unfortunately, such an architecture results in asignificant waste of energy in nanoscale CMOS implementationsdue to high leakage and bitline ...

Article
Single-ISA Heterogeneous Multi-Core Architectures: The Potential for Processor Power Reduction
Page 81

This paper proposes and evaluates single-ISA heterogeneousmulti-core architectures as a mechanism to reduceprocessor power dissipation. Our design incorporatesheterogeneous cores representing different points inthe power/performance design space; during ...

Article
Runtime Power Monitoring in High-End Processors: Methodology and Empirical Data
Page 93

With power dissipation becoming an increasingly vexingproblem across many classes of computer systems, measuringpower dissipation of real, running systems has becomecrucial for hardware and software system research and design.Live power measurements are ...

Article
Power-driven Design of Router Microarchitectures in On-chip Networks
Page 105

As demand for bandwidth increases in systems-on-a-chipand chip multiprocessors, networks are fast replacing busesand dedicated wires as the pervasive interconnect fabric foron-chip communication. The tight delay requirements facedby on-chip networks ...

Article
Optimum Power/Performance Pipeline Depth
Page 117

The impact of pipeline length on both the power andperformance of a microprocessor is explored boththeoretically and by simulation. A theory is presented fora wide range of power/performance metrics, BIPSm/W.The theory shows that the more important ...

Article
Processor Acceleration Through Automated Instruction Set Customization
Page 129

Application-specific extensions to the computational capabilities of a processor provide an efficient mechanism to meetthe growing performance and power demands of embeddedapplications. Hardware, in the form of new function units(or co-processors), and ...

Article
The Reconfigurable Streaming Vector Processor (RSVPTM)
Page 141

The need to process multimedia data places largecomputational demands on portable/embedded devices.These multimedia functions share commoncharacteristics: they are computationally intensive anddata-streaming, performing the same operation(s) onmany data ...

Article
Scaling and Charact rizing Database Workloads: Bridging the Gap between Research and Practice
Page 151

On-ine Transaction Processing (OLTP) workloads arecrucial benchmarks for the design and analysis of serverprocessors. Typical cached configurations used byresearchers to simulate OLTP workloads are orders ofmagnitude smaller than the fully scaled ...

Article
In Memory of Bob Rau
Page 165
Article
Generational Cache Management of Code Traces in Dynamic Optimization Systems
Page 169

A dynamic optimizer is a runtime software system thatgroups a program's instruction sequences into traces, optimizesthose traces, stores the optimized traces in a software-basedcode cache, and then executes the optimized code inthe code cache. To ...

Article
The Performance of Runtime Data Cache Prefetching in a Dynamic Optimization System
Page 180

Traditional software controlled data cache prefetching isoften ineffective due to the lack of runtime cache miss andmiss address information. To overcome this limitation, weimplement runtime data cache prefetching in the dynamicoptimization system ADORE ...

Article
IA-32 Execution Layer: a two-phase dynamic translator designed to support IA-32 applications on Itanium®-based systems
Page 191

IA-32 Execution Layer (IA-32 EL) is a newtechnology that executes IA-32 applications onIntel® Itanium® processor family systems.Currently, support for IA-32 applications onItanium-based platforms is achieved usinghardware circuitry on the Itanium ...

Article
LLVA: A Low-level Virtual Instruction Set Architecture
Page 205

A virtual instruction set architecture (V-ISA) implementedvia a processor-specific software translation layercan provide great flexibility to processor designers. Recentexamples such as Crusoe and DAISY, however, haveused existing hardware instruction ...

Article
Comparing Program Phase Detection Techniques
Page 217

Detecting program phase changes accurately is an importantaspect of dynamically adaptable systems. Threedynamic program phase detection techniques are compared- using instruction working sets, basic block vectors(BBV), and conditional branch counts. ...

Article
Using Interaction Costs for Microarchitectural Bottleneck Analysis
Page 228

Attacking bottlenecks in modern processors is difficultbecause many microarchitectural events overlap witheach other. This parallelism makes it difficult to both(a) assign a cost to an event (e.g., to one of two overlappingcache misses) and (b) assign ...

Article
Fast Path-Based Neural Branch Prediction
Page 243

Microarchitectural prediction based on neural learninghas received increasing attention in recent years. However,neural prediction remains impractical because its superioraccuracy over conventional predictors is not enough to offsetthe cost imposed by ...

Article
Hardware Support for Control Transfers in Code Caches
Page 253

Many dynamic optimization and/or binary translationsystems hold optimized/translated superblocks in a codecache. Conventional code caching systems suffer fromoverheads when control is transferred from one cachedsuperblock to another, especially via ...

Article
Exploiting Value Locality in Physical Register Files
Page 265

The physical register file is an important component of adynamically-scheduled processor. Increasing the amount of parallelismplaces increasing demands on the physical register file,calling for alternative file organization and management ...

Article
Macro-op Scheduling: Relaxing Scheduling Loop Constraints
Page 277

Ensuring back-to-back execution of dependent instructionsin a conventional out-of-order processor requiresscheduling logic that wakes up and selects instructions atthe same rate as they are executed. To sustain high performance,integer ALU instructions ...

Article
WaveScalar
Page 291

Silicon technology will continue to provide an exponential increasein the availability of raw transistors. Effectively translatingthis resource into application performance, however,is an open challenge. Ever increasing wire-delay relativeto switching ...

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Acceptance Rates

MICRO 36 Paper Acceptance Rate 35 of 134 submissions, 26%;
Overall Acceptance Rate 484 of 2,242 submissions, 22%
YearSubmittedAcceptedRate
MICRO-482836122%
MICRO-472795319%
MICRO-462393916%
MICRO 412104019%
MICRO 401663521%
MICRO 391744224%
MICRO 381472920%
MICRO 371582918%
MICRO 361343526%
MICRO 331103128%
MICRO 321312721%
MICRO 311082826%
MICRO 301033534%
Overall2,24248422%