Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/70082.68185acmconferencesArticle/Chapter ViewAbstractPublication PagesasplosConference Proceedingsconference-collections
Article
Free access

Overlapped loop support in the Cydra 5

Published: 01 April 1989 Publication History

Abstract

The CydraTM 5 architecture adds unique support for overlapping successive iterations of a loop to a very long instruction word (VLIW) base. This architecture allows highly parallel loop execution for a much larger class of loops than can be vectorized, without requiring the unrolling of loops usually used by compilers for VLIW machines. This paper discusses the Cydra 5 loop scheduling model, the special architectural features which support it, and the loop compilation techniques used to take full advantage of the architecture.

References

[1]
Allen, J.R., "Dependence Analysis for Subscripted Variables and its Application to Program Transformations," Ph.D. Thesis, Rice University, April 1983.
[2]
Allen, J.R., and K. Kennedy, "Automatic Translation of FORTRAN Programs to Vector Form," Transactions on Programming Languages and Systems, October 1987.
[3]
Allen, J.R., K. Kennedy, C. Portertield, and J. Warren, "Conversion of Control Dependence to Data Dependence," Proc. of the 10th Annual ACM Syrup. on Principals of Programming Languages, January 1983.
[4]
Banerjee, U., "Data Dependence in Ordinary Programs," Report No. UiUCDCS-R-76-837 (M.S. Thesis), Dept. of Computer Science, University of Illinois at Urbma-Champaign, November 1976.
[5]
Callahan, D., I. Cocko, and K. Konn~y, "Estimating Interlock and improving Balance for Pipclined Architectures," Journal of Parallel and Distributed Computing, August 1988, pp. 334-358.
[6]
Charleswonh, A.E., "An Approach to Scientific Array Processing: The Architectural Design of the AP-120BIFPS-164 Family," iNEE Computer, September 1981, pp. 18-27.
[7]
Colwell, R.P., et al, "A VLIW Architecture for a Trace Scheduling Compiler," SIGPLAN Notices 22, 10 (Proc. of ASPLOS II), Octobcr 1987.
[8]
Ellis, J.R., Bulldog: A Compiler for VI2W Architectures, MIT Press, Cambridge, Mass., 1986.
[9]
Ferrante, J., "What's in a Name, Or The Value of Renaming for ParaUelism De~on and Storage AHocation," Technical Report //12157, IBM Thomas J. Watson Research Center, January 1987.
[10]
Fisher, J.A., "Very Long Instruction Word Architectures and the ELI-512," Proc. of the 10th Annual Int'l Syrup. on Computer Architecture, june 1983.
[11]
Hennessy, J., N. Iouppi, S. Przbysld, C. Rowen, T. Gross, F. Baakett, and J. Gill, "MIPS: A Microprocessor Architecture," 15th Annual Workshop on Microprogrammlng, October 1982, pp. 17-22.
[12]
Hsu, Peter Y.-T., Highly Concurrent Scalar Processing, Ph.D. Thesis, Univ~sity of Illinois at Urbana- Champaign, 1986.
[13]
Joshi, S.M., and D.M. Dhamdhere, "A Composite Hoisting - Strength Reduction Transformation for Global Program Optimization," Parts I and II, Int'l J. Computer Math., VoL 11, 1982, pp. 21=41, 111-126.
[14]
Kuck, DJ., et al, "Dependence Graphs and Compiler Optimizations," Proc. of the 8th Annual ACM Syrup. on Principles of Programming Languages, January 1981.
[15]
Lain, M.S.-L., "A Systolic Array Optimizing Compiler," Ph.D. Thesis, Carnegie Mellon University, May 1987.
[16]
McMahon, F.H., '~he Livemx)re Fortran Kernels: A Computer Test of the Numcrical Performance Range," Technical Report UCRL-53745, Lawrence Livcrmorc National Laboratory, December 1986.
[17]
Rau, B.R., and C.D. Glaeser, "Some Scheduling Techniques and an Easily Schedulable Horizontal Architecture for High Performance Scientific Computing," Proc. of the 14th Annual Microprogramming Workshop, October 1982.
[18]
Rau, B.R., C.D. Glaescr, and R.L. Picard, "Efficient Code Generation for Horizontal Architectures: Compiler Techniques and Architectural Support," Pt~. of the 9th Annual Int'l Syrup. on Computer Architecture, 1982.
[19]
Rau, B.R., "Cydra 5 Dlrect~ Dataflow Architecture," Proc. of COMPCON 88, San Francisco, California, t988.
[20]
Rau, B.R., D.W.L. Yen, W. Yen, and R.A. Towle, "l'he Cydra 5 Departmental Supercomputer: Design Philosophies, Decisions and Trade-offs," to appear in IEEE Computer special issue "Real Machines," 1989.
[21]
Schlansker, M., and M. McNamara, "The Cydra 5 Computer System Architecture," Proc. of ICCD'88, October, 1988.
[22]
Touzeau, R.F., "A Fortran Compiler for the FPS-164 Scientific Computer," SIGPLAN Notices (Proc. of the SiGPLAN '84 Syrup. on Compiler Construction), June 1984, pp. 48-57.
[23]
Towle, R.A., "Control and Data Dependence for Program Transformations," Ph.D. Thesis, University of Illinois at Urbana-Champaign, March 1976.

Cited By

View all
  • (2016)Array Data Flow Analysis for Load-Store Optimizations in Fine-Grain ArchitecturesInternational Journal of Parallel Programming10.1007/BF0335675724:6(481-512)Online publication date: 26-May-2016
  • (2016)Evaluating the Effects of Predicated Execution on Branch PredictionInternational Journal of Parallel Programming10.1007/BF0335674624:2(159-186)Online publication date: 26-May-2016
  • (2016)Modulo SchedulingInstruction Level Parallelism10.1007/978-1-4899-7797-7_6(133-165)Online publication date: 30-Nov-2016
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ASPLOS III: Proceedings of the third international conference on Architectural support for programming languages and operating systems
April 1989
303 pages
ISBN:0897913000
DOI:10.1145/70082
  • cover image ACM SIGARCH Computer Architecture News
    ACM SIGARCH Computer Architecture News  Volume 17, Issue 2
    Special issue: Proceedings of ASPLOS-III: the third international conference on architecture support for programming languages and operating systems
    April 1989
    291 pages
    ISSN:0163-5964
    DOI:10.1145/68182
    Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 April 1989

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

ASPLOS89
Sponsor:

Acceptance Rates

Overall Acceptance Rate 535 of 2,713 submissions, 20%

Upcoming Conference

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)81
  • Downloads (Last 6 weeks)11
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2016)Array Data Flow Analysis for Load-Store Optimizations in Fine-Grain ArchitecturesInternational Journal of Parallel Programming10.1007/BF0335675724:6(481-512)Online publication date: 26-May-2016
  • (2016)Evaluating the Effects of Predicated Execution on Branch PredictionInternational Journal of Parallel Programming10.1007/BF0335674624:2(159-186)Online publication date: 26-May-2016
  • (2016)Modulo SchedulingInstruction Level Parallelism10.1007/978-1-4899-7797-7_6(133-165)Online publication date: 30-Nov-2016
  • (2016)Overview of ILP ArchitecturesInstruction Level Parallelism10.1007/978-1-4899-7797-7_2(9-42)Online publication date: 30-Nov-2016
  • (2014)BibliographyAdvanced Backend Code Optimization10.1002/9781118625446.biblio(327-343)Online publication date: 3-Jun-2014
  • (2013)The code size advantage of predicted execution for software pipelining9th Computing in Aerospace Conference10.2514/6.1993-4671Online publication date: 18-Feb-2013
  • (2012)On the effectiveness of register moves to minimise post-pass unrolling in software pipelined loops2012 International Conference on High Performance Computing & Simulation (HPCS)10.1109/HPCSim.2012.6266972(551-558)Online publication date: Jul-2012
  • (2012)Minimal Unroll Factor for Code Generation of Software PipeliningInternational Journal of Parallel Programming10.1007/s10766-012-0203-z41:1(1-58)Online publication date: 17-Jul-2012
  • (2010)Software PipeliningThe Compiler Design Handbook10.1201/9781420040579.ch18Online publication date: 7-Mar-2010
  • (2010)Instruction SchedulingThe Compiler Design Handbook10.1201/9781420040579.ch17Online publication date: 7-Mar-2010
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media