Loop optimization techniques on multi-issue architectures

September 1995

Author:
Dan Richard Kaiser
The Univ. of Michigan

Publisher:

University of Michigan
Dept. 72 Ann Arbor, MI
United States

Order Number:UMI Order No. GAX95-27657

Bibliometrics

Abstract

This work examines the interaction of compiler scheduling techniques with processor features such as the instruction issue policy. Scheduling techniques designed to exploit instruction level parallelism are employed to schedule instructions for a set of multi-issue architectures. A compiler is developed which supports block scheduling, loop unrolling, and software pipelining for a range of target architectures. The compiler supports aggressive loop optimizations such as induction variable detection and strength reduction, and code hoisting. A set of machine configurations based on the MIPS R3000 ISA are simulated, allowing the performance of the combined compiler-processor to be studied. The Aurora III, a prototype superscalar processor, is used as a case study for the interaction of compiler scheduling techniques with processor architecture.Our results show that the scheduling technique chosen for the compiler has a significant impact on the overall system performance and can even change the rank ordering when comparing the performance of VLIW, DAE and superscalar architectures. Our results further show that, while significant, the performance effects of the instruction issue policy may not be as large as the effects of other processor features, which may be less costly to implement, such as 64 bit wide data paths or store buffers.

Cited By

Das A, Dally W and Mattson P Compiling for stream processing Proceedings of the 15th international conference on Parallel architectures and compilation techniques, (33-42)

Contributors

Dan Richard Kaiser
University of Michigan, Ann Arbor
- Publication Years1995 - 1995
- Publication counts1
- Citation count1
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article1
View Full Profile

Index Terms

Loop optimization techniques on multi-issue architectures

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Loop fusion for clustered VLIW architectures

Embedded systems require maximum performance from a processor within significant constraints in power consumption and chip cost. Using software pipelining, high-performance digital signal processors can often exploit considerable instruction-level ...
Loop fusion for clustered VLIW architectures
LCTES/SCOPES '02: Proceedings of the joint conference on Languages, compilers and tools for embedded systems: software and compilers for embedded systems

Embedded systems require maximum performance from a processor within significant constraints in power consumption and chip cost. Using software pipelining, high-performance digital signal processors can often exploit considerable instruction-level ...
Loop transformations for clustered vliw architectures

Browse Theses

Sections

Cited By

Index Terms

Loop fusion for clustered VLIW architectures

Loop fusion for clustered VLIW architectures

Loop transformations for clustered vliw architectures

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Loop fusion for clustered VLIW architectures

Loop fusion for clustered VLIW architectures

Loop transformations for clustered vliw architectures