Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/144953.145795acmconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
Article
Free access

Code generation schema for modulo scheduled loops

Published: 10 December 1992 Publication History
First page of PDF

References

[1]
(Special issue on IBM RISC System/6000 processor). IBM Journal of Research and Development 34, i (i990).
[2]
Allen, J.R., Kennedy, K., Porterfield, C., and Warren, J. Conversion of control dependence to data dependence. In Proceedings of Tenth Annual ACM Symposium of Principles of Programming languages, (1983).
[3]
Butler, M., et al. Single instruction stream parallelism is greater than two. In Proceedings of Eighteenth Annual International Symposium on Computer Architecture, (Toronto, 1991).
[4]
Char!esworth, A.E. An approach to scientific array processing: the architectural design of the AP-120B/FPS-164 family. IEEE Computer 14, 9 (1981), 18-27.
[5]
Dehnert, J.C., Hsu, P.Y.-T., and Bratt, J.P. Overlapped loop support in the Cydra 5. In Proceedings of the Third International Conference on Architectural Support for Program tm'ng~guages and Operating Systems, (Boston, Mass., 1989), 26-38.
[6]
Ebcioglu, K., and Nakatanl, T. A new compilation technique for Raralle!!zi.ng loops with unpredictable branches on a v 14 w arcmtecmre, m canguages and Compilers for Parallel Computing, Gelernter, D., Nicolau, A., and Padua, D., Editor. 1989, Pitman/The MIT Press: London. p. 213-229.
[7]
Fisher, J.A. Trace scheduling: a technique for global microcode compaction. IEEE Transactions on Computers C- 30, 7 (1981).
[8]
Foster, C.C., and Riseman, E.M. Percolation of code to enhance parallel dispatching and execution. IEEE Transactions on Computers C-21, 12 (1972), 1411-1415.
[9]
Hsu, P.Y.-T. Highly Concurrent Scalar Processing. Coordinated Science Lab. Technical Report CSG-49. University of Illinois, 1986.
[10]
Jain, S. Circular scheduling: a new technique to perform software pipelining,m' rroceeamgs oj ~ne ~c. ln olurt~,~v ~I Conference on Programming Language Design and Implementation, (1991), 219-228.
[11]
Dam, M. Software pipelining; an effective scheduling technique for VLIW machines. In Proceedings of the ACM SIGPLAN '88 Conference on Programming Language Design and_ !mp!en~ntation; (!988), 3!8-327:
[12]
Lee, R.L., Kwok, A.Y., and Briggs, F.A. The floating point performance of a superscalar SPARC processor. In Proceedings of the Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, (Santa Clara, California, 1991), 28-37.
[13]
Mahlke, S.A., et al. Sentinel scheduling for VLIW and superscaiar processors, in Proceedings of the The Fifth international Conference on Architectural Support for Programming Languages and Operating Systems, (Boston, IVIaSSaC, IIU$St~LI~, 1 ~,~,).
[14]
Nicolau, A., and Fisher, J.A. Measuring the parallelism available for very long instruction word architectures. IEEE Transactions on Comp,ters C-33.
[15]
Nicolau, A., and Potasman, R. Realistic scheduling: compaction for pipelined axchitecmres. In Proceedings of the 23th Ann_u.a! Workshop on Microprogramming and Microarchitecture, (Orlando, Florida, 1990), 69-79.
[16]
Rau, B.R., and Glaeser, C.D. Some scheduling techniques and an easily schedulahle horizontal architecture for high performance scientific computing. In Proceedings of the Fourteenth Annual Workshop on Microprogramming, (1981), 183-198.
[17]
Rau, B.R., Lee, M., Tirumalai, P., and Schlansker, M.S. Register allocation for software pipelined loops. In Proceedings of the SIGPLAN'92 Conference on Programming Language Design and implementation, (San Francisco, 1992).
[18]
Rau, B.R., Schlansker, M.S., and Tirumalai, P.P. Code generation schemas for modulo scheduled DO-loops and Laboratories, 1992.
[19]
Rau, B.R., Yen, D.W.L., Yen, W., and Towle, R.A. The Cydra s; departmental mpercomputer: design nhiln.~cmhles~ decisions and trade-offs. IEEE Computer 22, i (1989).
[20]
Riseman, E.M., and Foster, C.C. The inhibition of potential parallelism by conditional jumps, iEEE Transactions on Computers C-21, 12 (1972), 1405-1411.
[21]
Su, B., and Wang, J. GURPR*: a new global software pipelining algorithm. In Proceedings of the 24th Annual International Symposium on Microarchitecture, (Albuquerque, New Mexico, 1991), 212-216.
[22]
Tirumalai, P., Lee, M., and Schlansker, M.S. Parallelizatioa of loops with exits on pipeiined architectures, in Proceedings of the Supercomputing '90, (1990), 200-212.
[23]
Tjaden, G.S., and Flynn, M.J. Detection and parallel execution OI pigasiel iu~tructlun~, tg~,f-, Transactions on Computers C-19 10 (1970), 889-895.
[24]
Touzeau, R.F. A FORTRAN compiler for the FPS-164 Scientific computer. In proceedings of the ACM SIGPLAN'84 Symposium on Compiler Construction, (1984), 48-57.
[25]
Wall, D.W. Limits of instruction-level parallelism. In Prncn~.dln~,s nf the Fourth International Conference on ~4rchite-ctu~al-Support-for'-Programming Languages and Operating Systems, (1991), 176-188.

Cited By

View all
  • (2010)Minimizing communication in rate-optimal software pipelining for stream programsProceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization10.1145/1772954.1772984(210-217)Online publication date: 24-Apr-2010
  • (2009)Synergistic execution of stream programs on multicores with acceleratorsACM SIGPLAN Notices10.1145/1543136.154246644:7(99-108)Online publication date: 19-Jun-2009
  • (2009)Synergistic execution of stream programs on multicores with acceleratorsProceedings of the 2009 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems10.1145/1542452.1542466(99-108)Online publication date: 19-Jun-2009
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
MICRO 25: Proceedings of the 25th annual international symposium on Microarchitecture
December 1992
301 pages
ISBN:0818631759

Sponsors

Publisher

IEEE Computer Society Press

Washington, DC, United States

Publication History

Published: 10 December 1992

Check for updates

Qualifiers

  • Article

Conference

MICRO92
Sponsor:

Acceptance Rates

Overall Acceptance Rate 484 of 2,242 submissions, 22%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)139
  • Downloads (Last 6 weeks)38
Reflects downloads up to 24 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2010)Minimizing communication in rate-optimal software pipelining for stream programsProceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization10.1145/1772954.1772984(210-217)Online publication date: 24-Apr-2010
  • (2009)Synergistic execution of stream programs on multicores with acceleratorsACM SIGPLAN Notices10.1145/1543136.154246644:7(99-108)Online publication date: 19-Jun-2009
  • (2009)Synergistic execution of stream programs on multicores with acceleratorsProceedings of the 2009 ACM SIGPLAN/SIGBED conference on Languages, compilers, and tools for embedded systems10.1145/1542452.1542466(99-108)Online publication date: 19-Jun-2009
  • (2009)Software Pipelined Execution of Stream Programs on GPUsProceedings of the 7th annual IEEE/ACM International Symposium on Code Generation and Optimization10.1109/CGO.2009.20(200-209)Online publication date: 22-Mar-2009
  • (2008)VEALACM SIGARCH Computer Architecture News10.1145/1394608.138215536:3(389-400)Online publication date: 1-Jun-2008
  • (2008)Orchestrating the execution of stream programs on multicore platformsACM SIGPLAN Notices10.1145/1379022.137559643:6(114-124)Online publication date: 7-Jun-2008
  • (2008)Orchestrating the execution of stream programs on multicore platformsProceedings of the 29th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/1375581.1375596(114-124)Online publication date: 7-Jun-2008
  • (2008)VEALProceedings of the 35th Annual International Symposium on Computer Architecture10.1109/ISCA.2008.33(389-400)Online publication date: 21-Jun-2008
  • (2008)Timing optimization via nest-loop pipelining considering code sizeMicroprocessors & Microsystems10.1016/j.micpro.2008.02.00232:7(351-363)Online publication date: 1-Oct-2008
  • (2005)Exploiting Vector Parallelism in Software Pipelined LoopsProceedings of the 38th annual IEEE/ACM International Symposium on Microarchitecture10.1109/MICRO.2005.20(119-129)Online publication date: 12-Nov-2005
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media