Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/335231.335263acmconferencesArticle/Chapter ViewAbstractPublication PagesicsConference Proceedingsconference-collections
Article
Free access

A low-complexity issue logic

Published: 08 May 2000 Publication History

Abstract

One of the main concerns in today's processor design is the issue logic. Instruction-level parallelism is usually favored by an out-of-order issue mechanism where instructions can issue independently of the program order. The out-of-order scheme yields the best performance but at the same time introduces important hardware costs such as an associative look-up, which might be prohibitive for wide issue processors with large instruction windows. This associative search may slow-down the clock-rate and it has an important impact on power consumption. In this work, two new issue schemes that reduce the hardware complexity of the issue logic with minimal impact on the average number of instructions executed per cycle are presented.

References

[1]
M.T. Bohr, "Interconnect Scaling - The Real Limiter to High Performance VLSI", in Proc. of the 1995 IEEE Int. Electron Devices Meeting, pp. 241-244, 1995.
[2]
D. Burger, T.M. Austin, S. Bennett, "Evaluating Future Microprocessors: The SimpleScalar Tool Set", Technical Report CS-TR-96-1308, University of Wisconsin-Madison, 1996.
[3]
R. Canal, J.M. Parcerisa, A. Gonzalez, "Dynamic Cluster Assignment Mechanisms", in Proc. of the Int. Symp. on High-Performance Computer Architecture, pp. 133-142, 2000.
[4]
J.LI. Cruz, A. Gonzalez, M. Valero, N. Topham, "Multiple-Banked Register File Architectures" in Proc. of the 27nd lnt. Syrup. on Computer Architecture, 2000.
[5]
K.I. Farkas, E Chow, N.P. Jouppi, Z. Vranesic, "The Multicluster Architecture: Reducing Cycle Time Through Partitioning", in Proc of the 30th. Ann. Syrup. on Microarchitecture, pp. 149-159, 1997.
[6]
J.A. Fisher, "Very Long Instruction Word and ELI- 512", in Proc. of the lOth Syrup. on Computer Architecture, Stockholm, Sweden, pp. 140-150, 1983.
[7]
M. Franklin, "The Multiscalar Architecture", Ph.D. Thesis, Technical Report TR 1196, Computer Sciences Department, Univ. of Wisconsin-Madison, 1993.
[8]
G.A. Kemp, M. Franklin, "PEWs: A Decentralized Dynamic Scheduler for ILP Processing", in Proc. of the Int. Conf. on Parallel Processing, v.1, pp 239-246, 1996.
[9]
P. Marcuello, A. Gonzalex, J. Tubella, "Speculative Multithreaded Processors", in Proc. of the Int. Conf. on Supercomputing, pp. 77-84, 1998.
[10]
D. Matzke, "Will Physical Scalability Sabotage Performance Gains", IEEE Computer Vol. 30, num. 9, pp.37-39, 1997.
[11]
S. 0nder, R. Gupta, "Superscalar Execution with Dynamic Data Forwarding", in Proc. Int. Conference on Parallel Architectures and Compilation Techniques, pp. 130-135, 1998.
[12]
S. Palacharla, N.E Jouppi, and J.E. Smith, "Complexity-Effective Superscalar Processors", in Proc of the 24th. Int. Symp. on Comp. Architecture, pp 1-13, 1997.
[13]
E. Rotenberg, Q. Jaeobson, Y. Sazeides and J.E. Smith, ''Trace Processors", in Proc of the 30th. Ann. Symp. on Microarchitecture, 1997.
[14]
Semiconductor Industry Association, "The National Technology Roadmap for Semiconductors", 1997.
[15]
G.S. Sohi, S.E. Breach, and T.N. Vijaykumar, "Multiscalar Processors", in Proc. of the 22nd Int. Symp. on Computer Architecture, pp. 414-425, 1995.
[16]
R.M. Tomasulo, "An efficient algorithm for exploiting multiple arithmetic units", IBM Journal of Research and Development vol 11, pp. 25-33, 1967.
[17]
D.M. Tullsen, S.J. Eggers, H.M. Levy, "Simultaneous Multithreading: Maximizing On-Chip Parallelism", in Proc. of the Int. Symp. on Computer Architecture, pp. 392-403, 1995.
[18]
D.W. Wall, "Limits of Instruction-Level Parallelism", Techincal Report WRL 93/6, Digital Western Research Lab, 1993.
[19]
S. Weiss, J.E. Smith, "Instruction Issue Logic in Pipelined Supercomputers", in the IEEE transactions on computers, vol. Co33, no.l l, pp 1013-1022, November 1984.
[20]
W. Yamamoto, M. Nemirovsky, "Increasing superscalar performance through multistreaming", in Proc. of the Int. Conf. on Parallel Architectures and Compilation Techniques, pp. 49-58, 1995

Cited By

View all
  • (2023)Referring Expression Comprehension Via Enhanced Cross-modal Graph Attention NetworksACM Transactions on Multimedia Computing, Communications, and Applications10.1145/354868819:2(1-21)Online publication date: 6-Feb-2023
  • (2023)Code Structure–Guided Transformer for Source Code SummarizationACM Transactions on Software Engineering and Methodology10.1145/352267432:1(1-32)Online publication date: 13-Feb-2023
  • (2022)Physics-directed Data Augmentation for Deep Model Transfer to Specific SensorACM Transactions on Sensor Networks10.1145/354907619:1(1-30)Online publication date: 8-Dec-2022
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICS '00: Proceedings of the 14th international conference on Supercomputing
May 2000
347 pages
ISBN:1581132700
DOI:10.1145/335231
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 08 May 2000

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. in-order issue
  2. instruction issue logic
  3. out-of-order issue
  4. wide-issue superscalar

Qualifiers

  • Article

Conference

ICS00
Sponsor:
ICS00: International Conference on Supercomputing
May 8 - 11, 2000
New Mexico, Santa Fe, USA

Acceptance Rates

ICS '00 Paper Acceptance Rate 33 of 122 submissions, 27%;
Overall Acceptance Rate 629 of 2,180 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)62
  • Downloads (Last 6 weeks)7
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2023)Referring Expression Comprehension Via Enhanced Cross-modal Graph Attention NetworksACM Transactions on Multimedia Computing, Communications, and Applications10.1145/354868819:2(1-21)Online publication date: 6-Feb-2023
  • (2023)Code Structure–Guided Transformer for Source Code SummarizationACM Transactions on Software Engineering and Methodology10.1145/352267432:1(1-32)Online publication date: 13-Feb-2023
  • (2022)Physics-directed Data Augmentation for Deep Model Transfer to Specific SensorACM Transactions on Sensor Networks10.1145/354907619:1(1-30)Online publication date: 8-Dec-2022
  • (2022)Efficient Instruction Scheduling Using Real-time Load Delay TrackingACM Transactions on Computer Systems10.1145/354868140:1-4(1-21)Online publication date: 24-Nov-2022
  • (2022)Robust level-3 BLAS Inverse Iteration from the Hessenberg MatrixACM Transactions on Mathematical Software10.1145/354478948:3(1-30)Online publication date: 10-Sep-2022
  • (2022)Design and optimization of Issue queue in Out-of-Order superscalar microprocessor2022 Asia Conference on Algorithms, Computing and Machine Learning (CACML)10.1109/CACML55074.2022.00056(294-298)Online publication date: Mar-2022
  • (2021)OmegaflowProceedings of the 35th ACM International Conference on Supercomputing10.1145/3447818.3460367(152-163)Online publication date: 3-Jun-2021
  • (2020)Delay and Bypass: Ready and Criticality Aware Instruction Scheduling in Out-of-Order Processors2020 IEEE International Symposium on High Performance Computer Architecture (HPCA)10.1109/HPCA47549.2020.00042(424-434)Online publication date: Feb-2020
  • (2018)High-Performance Instruction Scheduling Circuits for Superscalar Out-of-Order Soft ProcessorsACM Transactions on Reconfigurable Technology and Systems10.1145/309374111:1(1-22)Online publication date: 9-Jan-2018
  • (2016)High Performance Instruction Scheduling Circuits for Out-of-Order Soft Processors2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)10.1109/FCCM.2016.11(9-16)Online publication date: May-2016
  • Show More Cited By

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media