Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/602902.603016acmconferencesArticle/Chapter ViewAbstractPublication PagesiccadConference Proceedingsconference-collections
Article
Free access

Exploring performance tradeoffs for clustered VLIW ASIPs

Published: 05 November 2000 Publication History

Abstract

VLIW ASIPs provide an attractive solution for increasingly pervasive real-time multimedia and signal processing embedded applications. In this paper we propose an algorithm to support trade-off exploration during the early phases of the design/specialization of VLIW ASIPs with clustered datapaths. For purposes of an early exploration step, we define a parameterized family of clustered datapaths D(m,n), where m and n denote interconnect capacity and cluster capacity constraints on the family. Given a kernel, the proposed algorithm explores the space of feasible clustered datapaths and returns: a datapath configuration; a binding and scheduling for the operations; and a corresponding estimate for the best achievable latency over the specified family. Moreover, we show how the parameters m and n, as well as a target latency optionally specified by the designer, can be used to effectively explore trade-offs among delay, power/energy, and latency. Extensive empirical evidence is provided showing that the proposed approach is strikingly effective at attacking this complex optimization problem.

Supplementary Material

ZIP File (a504-jacome.zip)
Presentations from the 2000 ICCAD conference: systems design automation for network processors and wireless chipsets

References

[1]
D.C. Burger and J.R. Goodman. Billion-transistor architectures. IEEE Computer, 30(9), 1997.
[2]
C.M.Chu and J.Rabaey. Hardware selection and clustering in the HYPER synthesis system. In Proc. IEEE European Conference of Design Automation, March 1992.
[3]
G. de Micheli. Synthesis and Optimization of Digital Ciruits. McGraw-Hill, Inc, 1994.
[4]
D.S.Rao and F.J.Kurdahi. Partitioning by regularity extraction. In Proc. IEEE/ACM Design Automation Conference, June 1992.
[5]
W. Geurts, F. Catthor, S. Vernalde, and H. DeMan. Accelerator Data-Path Synthesis for High-Throughput Signal Processing Applications. Kluwer Academic Publishers, 1997.
[6]
E. Ifeachor and B. Jervis. Digital signal processing: A practical approach. Addison-Wesley, 1993.
[7]
M. Jacome and G. de Veciana. Lower bound on latency for VLIW ASIPs. In Proc. of ACM/IEEE International Conference on Computer Aided Design (ICCAD), Nov 1999.
[8]
K.R.Rao and P.Yip. Discrete Cosine Transform: Algorithms, Advantages, Applications. Academic Press, 1990.
[9]
C. Liem. Retargetable compilers for embedded core processors. Kluwer Academic Publishers, 1997.
[10]
P. Marwedel and Gert Goossens, editors. Code Generation for Embedded Processors. Kluwer Academic Publishers, 1995.
[11]
NOVA Project: ASIPs and retargetable compilers; CAD for embedded systems. Department of ECE, U.T. Austin. http://horizon.ece.utexas.edu/~jacome/nova/.
[12]
M. Rim and R. Jain. Lower bound performance estimation for the high-level synthesis scheduling problem. IEEE Trans. on CAD of ICs and Systems, 13(4):451-58, 1994.
[13]
S. Rixner, W. Dally, U. Kapasi, B. Khailany, A. Lopez-Lagunas, P. Mattson, and J. Owens. A bandwidth-efficient architecture for media processing. In Proc. 31st Annual International Symposium on Microarchitecture, pages 3-13., Nov.-Dec. 1998.
[14]
S. Rixner, W. Dally, B. Khailany, P. Mattson, U. Kapasi, and J. Owens. Register organization for media processing. In Proc. 26th International Symposium on High-Performance Computer Architecture, May 1999.
[15]
E. A. Rundensteiner, D. Gajski, and L. Bic. Component synthesis from functional descriptions. IEEE Transactions on Computer Aided Design, 12(9), 1993.
[16]
G. Tiruvuri and M. Chung. Estimation of lower bounds in scheduling algorithms for high-level synthesis. ACM Trans. on DAES (TODAES), 3(2):162-80, 1998.

Cited By

View all
  • (2007)Design of a low power pre-synchronization ASIP for multimode SDR terminalsProceedings of the 7th international conference on Embedded computer systems: architectures, modeling, and simulation10.5555/1776200.1776244(322-332)Online publication date: 16-Jul-2007
  • (2007)A low power VLIW processor generation method by means of extracting non-redundant activation conditionsProceedings of the 5th IEEE/ACM international conference on Hardware/software codesign and system synthesis10.1145/1289816.1289872(227-232)Online publication date: 30-Sep-2007
  • (2006)Customization of application specific heterogeneous multi-pipeline processorsProceedings of the conference on Design, automation and test in Europe: Proceedings10.5555/1131481.1131693(746-751)Online publication date: 6-Mar-2006
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
ICCAD '00: Proceedings of the 2000 IEEE/ACM international conference on Computer-aided design
November 2000
558 pages
ISBN:0780364481

Sponsors

Publisher

IEEE Press

Publication History

Published: 05 November 2000

Check for updates

Qualifiers

  • Article

Conference

ICCAD '00
Sponsor:
ICCAD '00: International Conference on Computer Aided Design
November 5 - 9, 2000
California, San Jose

Acceptance Rates

Overall Acceptance Rate 457 of 1,762 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)19
  • Downloads (Last 6 weeks)6
Reflects downloads up to 14 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2007)Design of a low power pre-synchronization ASIP for multimode SDR terminalsProceedings of the 7th international conference on Embedded computer systems: architectures, modeling, and simulation10.5555/1776200.1776244(322-332)Online publication date: 16-Jul-2007
  • (2007)A low power VLIW processor generation method by means of extracting non-redundant activation conditionsProceedings of the 5th IEEE/ACM international conference on Hardware/software codesign and system synthesis10.1145/1289816.1289872(227-232)Online publication date: 30-Sep-2007
  • (2006)Customization of application specific heterogeneous multi-pipeline processorsProceedings of the conference on Design, automation and test in Europe: Proceedings10.5555/1131481.1131693(746-751)Online publication date: 6-Mar-2006
  • (2006)Platform-based resource binding using a distributed register-file microarchitectureProceedings of the 2006 IEEE/ACM international conference on Computer-aided design10.1145/1233501.1233648(709-715)Online publication date: 5-Nov-2006
  • (2006)Application specific forwarding network and instruction encoding for multi-pipe ASIPsProceedings of the 4th international conference on Hardware/software codesign and system synthesis10.1145/1176254.1176313(241-246)Online publication date: 22-Oct-2006
  • (2004)Dual-pipeline heterogeneous ASIP designProceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis10.1145/1016720.1016727(12-17)Online publication date: 8-Sep-2004
  • (2001)Tailoring pipeline bypassing and functional unit mapping to application in clustered VLIW architecturesProceedings of the 2001 international conference on Compilers, architecture, and synthesis for embedded systems10.1145/502217.502241(141-148)Online publication date: 16-Nov-2001
  • (2001)High-quality operation binding for clustered VLIW datapathsProceedings of the 38th annual Design Automation Conference10.1145/378239.379051(702-707)Online publication date: 22-Jun-2001

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media