Inherently lower-power high-performance superscalar architectures

January 2000

Author:
Victor V. Zyuban,
Director:
Peter M. Kogge

Publisher:

University of Notre Dame
275 Fitzpatrick Hall Notre Dame, IN
United States

ISBN:978-0-599-61169-6

Order Number:AAI9957899

Pages:

218

Purchase on ProQuest

Bibliometrics

Abstract

When it comes to performance, modern computer design has become a well structured art which starts with instruction sets that maximize opportunities for concurrency, follows through with fast organizational techniques such as pipelining and super scalar execution, and ends with clever macro and circuit designs that are based on inherently fast CMOS fabrication technologies. When it comes to low power, however, exactly the opposite is true. Current techniques start with lowering supply voltages and making process changes to minimize capacitance, followed by some relatively simple techniques for minimizing power for particular logic macros, and then utilizing relatively ad hoc techniques, such as ‘sleep modes’, at higher levels. This work attempts to reverse this by bringing the power issue to the earliest phase of high-performance microprocessor development. We propose a methodology for power- optimization of high-performance microprocessors at the microarchitecture level. In particular, our work explores solutions to the problem that do not compromise performance. First, major targets for power reduction are identified within microarchitecture, where power is heavily consumed, or will be heavily consumed in next- generation processors. This involves developing energy models for structures where power grows with increasing issue width, such as Register File, Issue Window, Memory Disambiguation Unit, etc. Then, a multicluster microarchitecture is developed that reduces energy in the identified critical design points, with minimal performance impact, Detailed simulation of the baseline and proposed multicluster architectures has been performed, optimizing both for the energy-delay metric. A comparison of the two microarchitectures, both optimized for energy efficiency, shows that the multicluster architecture is potentially up to twice as energy efficient for wide issue processors, with an advantage that grows with the issue width. Conversely, at the same power dissipation level the new multicluster architecture supports configurations with measurably higher performance than equivalent conventional designs.

Cited By

Contributors

Victor V Zyuban
IBM Thomas J. Watson Research Center
- Publication Years1998 - 2015
- Publication counts32
- Citation count1,141
- Available for Download16
- Downloads (cumulative)8,155
- Downloads (12 months)422
- Downloads (6 weeks)66
- Average Downloads per Article510
- Average Citation per Article36
View Full Profile
Peter Michael Kogge
University of Notre Dame
- Publication Years1988 - 2024
- Publication counts81
- Citation count938
- Available for Download33
- Downloads (cumulative)13,735
- Downloads (12 months)956
- Downloads (6 weeks)178
- Average Downloads per Article416
- Average Citation per Article12
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Inherently Lower-Power High-Performance Superscalar Architectures

In recent years, reducing power has become an important design goal for high-performance microprocessors. This work attempts to bring the power issue to the earliest phases of microprocessor development, in particular, the stage of defining a chip ...
Efficient superscalar performance through boosting

The foremost goal of superscalar processor design is to increase performance through the exploitation of instruction-level parallelism (ILP). Previous studies have shown that speculative execution is required for high instruction per cycle (IPC) rates ...
Efficient superscalar performance through boosting
ASPLOS V: Proceedings of the fifth international conference on Architectural support for programming languages and operating systems

The foremost goal of superscalar processor design is to increase performance through the exploitation of instruction-level parallelism (ILP). Previous studies have shown that speculative execution is required for high instruction per cycle (IPC) rates ...

Browse Theses

Sections

Cited By