Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/360128.360142acmconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
Article
Free access

Modulo scheduling for a fully-distributed clustered VLIW architecture

Published: 01 December 2000 Publication History
First page of PDF

Supplementary Material

PS File (p124-sanchez.ps)

References

[1]
V. Agarwal, M.S. Hrishikesh, S.W. Keckler and D. Burger, "Clock Rate versus IPC: The End of the Road For ConventionalMicroarchitectures, in Procs. of the 27th. Int. Symp. on Computer Architecture, pp. 248-259, June 2000
[2]
E. Ayguade, C. Barrado, A. Gonzalez, J. Labarta, D. Lopez, S. Moreno, D. Padua, F. Reig, Q. Riera and M. Valero, Ictineo: a Tool for Research on ILP", in Supercomputing'96 (SC'96), Research Exhibit Polaris at Work, 1996
[3]
N. Bermudo, X. Vera, A. Gonzalez and J. Llosa, An Efficient Solver for Cache Miss Equations, in Procs. of Int. Symp. on Performance Analysis and System Software, April 2000
[4]
A. Capitanio, D. Dytt and A. Nicolau,Partitioned Register Files for VLIWs: A Preliminary Analysis of Tradeoffs, in Procs. of 25th. Int. Symp. on Microarchitecture, pp. 192- 300, 1992
[5]
D. Culler and J.P. Singh, Parallel Computer Architecture. A Hardware/Software Approach, Morgan Kaufmann Publishers, Inc., 1999
[6]
J. R. Ellis, Bulldog: A Compiler for VLIW Architectures, MIT Press, pp. 180-184, 1986
[7]
M.M. Fernandes, J. Llosa and N. Topham, Distributed Modulo Scheduling, in Procs. of Int. Symp. on High-Performance Computer Architecture, pp. 130-134, Jan. 1999
[8]
J. Fridman and Zvi Greefield, The TigerSharc DSP Architecture, IEEE Micro, pp. 66-76, Jan-Feb. 2000
[9]
S. Ghosh, M. Martonosi and S. Malik, Cache Miss Equations: an Analytical Representation of Cache Misses, in Procs. of Int. Conf. on Supercomputing (ICS'97), pp. 317- 324, July 1997
[10]
L. Gwennap, Digital 21264 Sets New Standard, Microprocessor Report, 10(14), Oct. 1996
[11]
S. Jang, S. Carr, P. Sweany and D. Kuras, A Code Generation Framework for VLIW Architectures with Partitioned Register Banks, in Procs. of 3rd. Int. Conf. on Massively Parallel Computing Systems, April 1998
[12]
D. Kroft, Lockup-Free Instruction Fetch/Prefetch Cache Organization, in Procs. 8th Int. Symp. on Computer Architecture, pp. 81-87, 1981
[13]
M. Lam, Software pipelining: An Effective scheduling technique for VLIW Machines, in Procs. on Conf. on Programming Languages and Implementation Design, pp. 258- 267, June 1993
[14]
D. Matzke, Will Physical Scalability Sabotage Performance Gains, IEEE Computer, Vol. 30, No. 9, pp. 37-39, Sept. 1997
[15]
MAP1000 unfolds at Equator, Microprocessor Report, 12(16), Dec. 1998
[16]
T.C. Mowry, M.S. Lam and A. Gupta, Design and Evaluation of a Compiler Algorithm for Prefetching, in Procs. of the 5th. Ann. Symp. on Programming Languages and Operating Systems (ASPLOS-V), pp.62-73, Oct. 1992
[17]
E. Nystrom and A. E. Eichenberger, Effective Cluster Assingment for Modulo Scheduling, in Procs. of 31th. Int. Symp. on Microarchitecture, pp. 103-114, 1998
[18]
E. Ozer, S. Banerjia and T.M. Conte, Unified Assign and Schedule: A New Approach to Scheduling for Clustered Register File Microarchitectures, in Procs. of 31st Int. Symp. on Microarchitecture, pp. 308-315, Nov. 1998
[19]
S. Palacharla, N.P. Jouppi, and J.E. Smith, Complexity- Effective Superscalar Processors, in Procs. of the 24th. Int. Symp. on Computer Architecture, pp. 1-13, June 1997
[20]
B.R. Rau and C.D. Glaeser, Some Scheduling Techniques and an Easily Schedulable Horizontal Architecture for High Performance Scientific Computing, in Procs. on the 14th Ann. Workshop on Microprogramming, pp. 183-198, Oct. 1981
[21]
J. Sanchez and A. Gonzalez, Cache Sensitive Modulo Scheduling, in Procs. of 30th. Int. Symp. on Microarchitecture, pp. 338-348, Dec. 1997
[22]
J. Sanchez and A. Gonz~lez, The Effectiveness of Loop Unrolling for Modulo Scheduling in Clustered VLIW Architectures, in Procs. of the 29th. Int. Conf. on Parallel Processing, pp. 555-562, Aug. 2000
[23]
Semiconductor Industry Association, The National Technology Roadmap for Semiconductors: Technology Needs, 1997
[24]
Texas Instruments Inc., TMS320C62x/67x CPU and Instruction Set Reference Guide, 1998
[25]
X. Vera, J. Llosa, A. Gonzalez and C. Ciuraneta, A Fast Implementation of Cache Miss Equations, in Procs. of the 8th. Int. Workshop on Compilers for Parallel Computers, pp. 319-326, Jan. 2000
[26]
V.V. Zyuban, Low-Power High-Performance Superscalar Architectures, PhD Thesis, Dept. of Computer Science and Engineering, University of Notre Dame, Jan. 2000

Cited By

View all

Index Terms

  1. Modulo scheduling for a fully-distributed clustered VLIW architecture

                            Recommendations

                            Comments

                            Please enable JavaScript to view thecomments powered by Disqus.

                            Information & Contributors

                            Information

                            Published In

                            cover image ACM Conferences
                            MICRO 33: Proceedings of the 33rd annual ACM/IEEE international symposium on Microarchitecture
                            December 2000
                            357 pages
                            ISBN:1581131968
                            DOI:10.1145/360128
                            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                            Sponsors

                            Publisher

                            Association for Computing Machinery

                            New York, NY, United States

                            Publication History

                            Published: 01 December 2000

                            Permissions

                            Request permissions for this article.

                            Check for updates

                            Qualifiers

                            • Article

                            Conference

                            MICRO00
                            Sponsor:

                            Acceptance Rates

                            MICRO 33 Paper Acceptance Rate 31 of 110 submissions, 28%;
                            Overall Acceptance Rate 484 of 2,242 submissions, 22%

                            Contributors

                            Other Metrics

                            Bibliometrics & Citations

                            Bibliometrics

                            Article Metrics

                            • Downloads (Last 12 months)85
                            • Downloads (Last 6 weeks)27
                            Reflects downloads up to 25 Nov 2024

                            Other Metrics

                            Citations

                            Cited By

                            View all
                            • (2021)Selective Code Duplication for Soft Error Protection on VLIW ArchitecturesElectronics10.3390/electronics1015183510:15(1835)Online publication date: 30-Jul-2021
                            • (2013)Fast modulo scheduler utilizing patternized routes for coarse-grained reconfigurable architecturesACM Transactions on Architecture and Code Optimization10.1145/2541228.255531410:4(1-24)Online publication date: 1-Dec-2013
                            • (2010)Instruction SchedulingThe Compiler Design Handbook10.1201/9781420040579.ch17Online publication date: 7-Mar-2010
                            • (2010)An Asymmetrical Register File: The VWRUltra-Low Energy Domain-Specific Instruction-Set Processors10.1007/978-90-481-9528-2_8(199-222)Online publication date: 3-Jul-2010
                            • (2009)Energy-aware register file re-partitioning for clustered VLIW architecturesProceedings of the 2009 Asia and South Pacific Design Automation Conference10.5555/1509633.1509813(805-810)Online publication date: 19-Jan-2009
                            • (2009)Tiled Multicore ProcessorsMulticore Processors and Systems10.1007/978-1-4419-0263-4_1(1-33)Online publication date: 3-Aug-2009
                            • (2008)Edge-centric modulo scheduling for coarse-grained reconfigurable architecturesProceedings of the 17th international conference on Parallel architectures and compilation techniques10.1145/1454115.1454140(166-176)Online publication date: 25-Oct-2008
                            • (2008)Orchestrating the execution of stream programs on multicore platformsACM SIGPLAN Notices10.1145/1379022.137559643:6(114-124)Online publication date: 7-Jun-2008
                            • (2008)Orchestrating the execution of stream programs on multicore platformsProceedings of the 29th ACM SIGPLAN Conference on Programming Language Design and Implementation10.1145/1375581.1375596(114-124)Online publication date: 7-Jun-2008
                            • (2008)Modulo scheduling for highly customized datapaths to increase hardware reusabilityProceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization10.1145/1356058.1356075(124-133)Online publication date: 6-Apr-2008
                            • Show More Cited By

                            View Options

                            View options

                            PDF

                            View or Download as a PDF file.

                            PDF

                            eReader

                            View online with eReader.

                            eReader

                            Login options

                            Media

                            Figures

                            Other

                            Tables

                            Share

                            Share

                            Share this Publication link

                            Share on social media