Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/36206.36201acmconferencesArticle/Chapter ViewAbstractPublication PagesasplosConference Proceedingsconference-collections
Article
Free access

A VLIW architecture for a trace scheduling compiler

Published: 01 October 1987 Publication History

Abstract

Very Long Instruction Word (VLIW) architectures were promised to deliver far more than the factor of two or three that current architectures achieve from overlapped execution. Using a new type of compiler which compacts ordinary sequential code into long instruction words, a VLIW machine was expected to provide from ten to thirty times the performance of a more conventional machine built of the same implementation technology.Multiflow Computer, Inc., has now built a VLIW called the TRACETM along with its companion Trace SchedulingTM compacting compiler. This new machine has fulfilled the performance promises that were made. Using many fast functional units in parallel, this machine extends some of the basic Reduced-Instruction-Set precepts: the architecture is load/store, the microarchitecture is exposed to the compiler, there is no microcode, and there is almost no hardware devoted to synchronization, arbitration, or interlocking of any kind (the compiler has sole responsibility for runtime resource usage).This paper discusses the design of this machine and presents some initial performance results.

References

[1]
Kate85. Manolis Katevenis, Reduced Instruction Set Computer Architectures for VLSI, MIT Press, Cambridge, Mass., 1985.
[2]
Tjad70. G. S. Tjaden and M. J. Flynn, "Detection and parallel execution of independent instructions," Transactions on Computers, vol. C-19, no. 10, pp. 889--895, IEEE, October 1970.
[3]
Fost72. C. C. Foster and E. M. Riseman, "Percolation of code to enhance parallel dispatching and execution," Transactions on Computers, vol. C-21, no. 12, pp. 1411--1415, IEEE, December 1972.
[4]
Fish83. Joseph A. Fisher, "Very Long Instruction Word Architectures and the ELI-512," Proceedings of the 10th Symposium on Computer Architectures, pp. 140--150, IEEE, June, 1983.
[5]
Elli86.John R. Ellis, Bulldog: A Compiler for VLIW Architectures, MIT Press, Cambridge, Mass., 1986.
[6]
Fish79. Joseph A. Fisher, "The Optimization of Horizontal Microcode Within and Beyond Basic Blocks: An Application of Processor Scheduling with Resources," Technical Report COO-3077--161, Courant Mathematics and Computing Laboratory, New York University, October 1979.
[7]
Henn81. John L. Hennessy, N. Jouppi, F. Baskett, and J. Gill, "MIPS: A VLSI processor architecture," Proceedings of the CMU Conference on VLSI Systems and Computations, pp. 337--346, Computer Science Press, October 1981.
[8]
Radi82. George Radin, "The 801 Minicomputer," Proceedings SIGARCH/SIGPLAN Symposium on Architectural Support for Programming Languages and Operating Systems, pp. 39--47, ACM, March 1982.
[9]
Toma82. Robert M. Tomasulo, "An Efficient Algorithm for Exploiting Multiple Arithmetic Units," Computer Structures: Principles and Examples, pp. 293--305, McGraw-Hill, 1982.
[10]
Thor70. James E. Thornton, Design of a Computer: The Control Data 6600, Scott, Foresman & Company, Glenview, Illinois, 1970.
[11]
Acos86. R. D. Acosta, J. Kjelstrup, and H. C. Torng, "An Instruction Issuing Approach to Enhancing Performance in Multiple Functional Unit Processors," IEEE Transactions on Computers, vol. C-35, no. 9, pp. 815--828, September, 1986.
[12]
Seit85. Charles Seitz, "The cosmic cube," Communications of the ACM, vol. 28, no. 1, pp. 22--33, ACM, January 1985.
[13]
Walt87. David L. Waltz, "Applications of the Connection Machine," Computer, vol. 20, no. 1, pp. 85--97, IEEE, January 1987.
[14]
Fish81. Joseph A. Fisher, "Trace Scheduling: A technique for global microcode compaction," Transactions on Computers, vol. C-30, pp. 478--490, IEEE, July, 1981.
[15]
Fish84. Joseph A. Fisher and John J. O'Donnell, "VLIW Machines: Multiprocessors We Can Actually Program," CompCon 84 Proceedings, pp. 299--305, IEEE, 1984.
[16]
Elli84.John R. Ellis, Joseph A. Fisher, John C. Ruttenberg, and Alexandru Nicolau, "Parallel Processing: A Smart Compiler and a Dumb Machine," Proceedings of the SIGPLAN 84 Symposium on Compiler Construction, ACM SIGPLAN Notices, June 1984.
[17]
Pfis85.Gregory F. Pfister and V. Alan Norton, "Hot-Spot Contention and Combining in Multistage Interconnection Networks," Transactions on Computers, vol. C-34, pp. 943--948, IEEE, October 1985.
[18]
Smit82. Alan Jay Smith, "Cache Memories," ACM Computing Surveys, ACM, September 1982.
[19]
Clar85. Douglas W. Clark and Joel S. Emer, "Performance of the VAX-11/780 Translation Buffer: Simulation and Measurement," ACM Transactions on Computer Systems, vol. 3, no. 1, pp. 31--62, February 1985.
[20]
Wall86. David W. Wall, "Global Register Allocation at Link Time," Proceedings of the SIGPLAN 86 Symposium on Compiler Construction, ACM SIGPLAN Notices, July 1986.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASPLOS II: Proceedings of the second international conference on Architectual support for programming languages and operating systems
October 1987
205 pages
ISBN:0818608056
DOI:10.1145/36206
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 October 1987

Check for updates

Qualifiers

  • Article

Conference

ASPLOS II
Sponsor:

Acceptance Rates

Overall Acceptance Rate 535 of 2,713 submissions, 20%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)425
  • Downloads (Last 6 weeks)47
Reflects downloads up to 02 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2016)The Case for VLIW-CMP as a Building Block for ExascaleIEEE Computer Architecture Letters10.1109/LCA.2015.242469915:1(54-57)Online publication date: 1-Jan-2016
  • (2015)Branch vanguardACM SIGARCH Computer Architecture News10.1145/2872887.275040043:3S(323-335)Online publication date: 13-Jun-2015
  • (2015)Branch vanguardProceedings of the 42nd Annual International Symposium on Computer Architecture10.1145/2749469.2750400(323-335)Online publication date: 13-Jun-2015
  • (2011)From C to SiliconAnatomy of a Silicon Compiler10.1007/978-1-4615-3570-6_17(251-268)Online publication date: 31-Mar-2011
  • (2010)Power-Aware Embedded ComputingEmbedded Systems Handbook, Second Edition10.1201/9781439807637.ch11(11-1-11-25)Online publication date: 30-Apr-2010
  • (2010)Extension of Superblock Technique to Hyperblock Using Predicate Hierarchy GraphContemporary Computing10.1007/978-3-642-14825-5_19(217-229)Online publication date: 2010
  • (2007)Cluster-level simultaneous multithreading for VLIW processors2007 25th International Conference on Computer Design10.1109/ICCD.2007.4601890(121-128)Online publication date: Oct-2007
  • (2007)Merge Logic for Clustered Multithreaded VLIW Processors10th Euromicro Conference on Digital System Design Architectures, Methods and Tools (DSD 2007)10.1109/DSD.2007.4341492(353-360)Online publication date: Aug-2007
  • (2004)Software pipeliningACM SIGPLAN Notices10.1145/989393.98942039:4(244-256)Online publication date: 1-Apr-2004
  • (2001)Compiler-Assisted Multiple Instruction Word Retry for VLIW ArchitecturesIEEE Transactions on Parallel and Distributed Systems10.1109/71.97056412:12(1293-1304)Online publication date: 1-Dec-2001
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media