Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Improving instruction level parallelism through reconfigurable units in superscalar processors

Published: 01 June 2007 Publication History

Abstract

With reducing feature sizes, more transistors can be integrated on the chip. The increased transistor budget can be utilized to improve the instruction level parallelism (ILP) exploited from the processor. However, the transistors cannot be used to arbitrarily increase the processor width and size in the hope of exploiting better ILP. In this paper, we propose an architecture where the superscalar datapath is tightly coupled with a reconfigurable unit (RFU). The reconfiguration unit is configured to execute the traces of dynamic instructions that are frequently executed. To address the data dependency issues between the instructions in the superscalar and the RFU, we propose to execute the trace on the RFU with predicted values. When the trace instructions reach the issue queue in the superscalar, the predictions are validated. In this technique, performance improvement is obtained for correct prediction, whereas no performance degradation is incurred for mispredictions. With this architecture, we observe an average instructions per cycle (IPC) improvement of about 11% over the simulated SPEC 2000 benchmarks, using a very small last value data value predictor.

References

[1]
K. Wang and M. Franklin, "Highly accurate data value prediction using hybrid predictors," in International Symposium on Microarchitecture, 1997.
[2]
P. Athanas and H. Silverman, "Processor reconfiguration through instruction-set metamorphosis," in IEEE Computer, 26(3), March 1995.
[3]
C. Iseli and E. Sanchez, "Spyder: a sure (superscalar and reconfigurable) processor," in Journal of Supercomputing, 9(3):231--252, 1995.
[4]
R. Razdan and M. Smith, "A high-performance microarchitecture with hardware-programmable functional units," in In Proceedings of the 27th Annual International Symposium on Microarchitecture, 1994.
[5]
S. Hauck, T. Fry, M. Hosler, and J. Kao, "The chimaera reconfigurable functional unit," in In Proceedings of the IEEE Symposium on FPGAs for Custom Computing Machines, pp.87--96, Napa Valley, California, April 1997.
[6]
Z. Ye, N. Shenoy, and P. Banerjee, "A c compiler for a processor with a reconfigurable functional unit," in In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays, February 2000.
[7]
B. Kastrup, A. Bink, and J. Hoogerbrugge, "Concise: A compiler-driven cpld-based instruction set accelerator," in In Proceedings of the Seventh Annual IEEE Symposium on Field-Programmable Custom Computing Machines, pp.92--100, April 1999.
[8]
R. Wittig and P. Chow, "Onechip: An fpga processor with reconfigurable logic," in In Proceedings of the IEEE Symposium on Field-Programmable Custom Computing Machines, pp.126--135, Napa Valley, California, March 1996.
[9]
T. Callahan, J. Hauser, and J. Wawrzynek, "The garp architecture and c compiler," in IEEE Computer, 33(4):62--69, April 2000.
[10]
T. Callahan and J. Wawrzynek, "Adapting software pipelining for reconfigurable computing," in In Proceedings of the International Conference on Compilers, Architecture, and Synthesis for Embedded Systems (CASES), 2000.
[11]
C. R. Rupp, M. Landguth, T. Garverick, E. Gomersall, H. Holt, J. M. Arnold, and M. Gokhale., "The napa adaptive processing architecture," in In Proceeding of the IEEE symposium on FPGAs for computing machines. (FCCM/98), Pages 28--37, 1998.
[12]
T. Miyamori and K. Olukotun, "Remarc: Reconfigurable multimedia array co-processor," in IEICE Transactions on information and systems, E82-D(2):389--397, 1999.
[13]
H. Singh, M.-H. Lee, G. Lu, F. J. Kurdahi, N. Bagherzadeh, and E. M. C. Filho, "Morphosys: An integrated reconfigurable system for data-parallel and computation-intensive applications," in IEEE Transactions on Computers, 49(5): 465--481, 2000.
[14]
J. A. Jacob and P. Chow, "Memory interfacing an instruction specification for reconfigurable processors," in In Proceedings of the ACM/SIGDA International Symposium on Field Programmable Gate Arrays. (FPGA/99, Pages 145--154), 1999.
[15]
E. Waingold, M. Taylor, D. Srikrishna, V. Sarkar, W. Lee, V. Lee, J. Kim, M. Frank, P. Finch, R. Barua, J. Babb, S. Amarasinghe, and A. Agarwal, "Baring it all to software: Raw machines," in IEEE Computer, pp. 86--93, 1997.
[16]
S. Vassiliadis, S. Wong, G. N. Gaydadjiev, K. Bertels, G. Kuzmanov, and E. M. Panainte, "The molen polymorphic processor," in, IEEE Transactions on Computers, pp. 1363-- 1375, Volume 53, Issue 11, 2004.
[17]
N. Clark, J. Blome, M. Chu, S. Mahlke, S. Biles, and K. Flautner, "An architecture framework for transparent instruction set customization in embedded processors," in Proceedings of the 32nd Annual International Symposium on Computer Architecture. Pages: 272--283, 2005.
[18]
"Simple scalar toolset, version 2," http://www.simplescalar.com/.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGARCH Computer Architecture News
ACM SIGARCH Computer Architecture News  Volume 35, Issue 3
Special issue on the 2006 reconfigurable and adaptive architecture workshop
June 2007
55 pages
ISSN:0163-5964
DOI:10.1145/1294313
Issue’s Table of Contents

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 June 2007
Published in SIGARCH Volume 35, Issue 3

Check for updates

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 273
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media