Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1176254.1176297acmconferencesArticle/Chapter ViewAbstractPublication PagesesweekConference Proceedingsconference-collections
Article

SHAPES:: a tiled scalable software hardware architecture platform for embedded systems

Published: 22 October 2006 Publication History

Abstract

Nanoscale systems on chip will integrate billion-gate designs. The challenge is to find a scalable HW/SW design style for future CMOS technologies. Tiled architectures suggest a possible path: "small" processing tiles connected by "short wires". A typical SHAPES tile contains a VLIW floating-point DSP, a RISC, a DNP (Distributed Network Processor), distributed on chip memory, the POT (a set of Peripherals On Tile) plus an interface for DXM (Distributed External Memory). The SHAPES routing fabric connects on-chip and off-chip tiles, weaving a distributed packet switching network. 3D next-neighbours engineering methodologies is adopted for off-chip networking and maximum system density. The SW challenge is to provide a simple and efficient programming environment for tiled architectures. SHAPES will investigate a layered system software, which does not destroy algorithmic and distribution info provided by the programmer and is fully aware of the HW paradigm. For efficiency and QoS, the system SW manages intra-tile and inter-tile latencies, bandwidths, computing resources, using static and dynamic profiling. The SW accesses the on-chip and off-chip networks through a homogeneous interface.

References

[1]
R. Ho, K. Mai and M. Horowitz, "The Future of Wires", Proc. IEEE, 89-4 (2001)490--504.
[2]
D. Sylvester and K. Keutzer, "Impact of Small Process Geometries on Microarchitectures in Systems on a Chip", Proc. IEEE, 89-4(2001)467--489.
[3]
L. P. Carloni, A. L. Sangiovanni-Vincentelli, "Coping with latency in SOC Design", IEEE Micro 22-5 (2002) 24--35.
[4]
A. Allan et al., "2001 Technology Roadmap for Semiconductors", IEEE Computer 35-1(2002)42--53.
[5]
W. J. Dally and S. Lacy, "VLSI Architectures: Past, Present and Future", Proc. Advanced Research in VLSI Conf., IEEE Press (1999)232--241.
[6]
M. B. Taylor et al., "The Raw Microprocessor: A Computational Fabric for Software Circuits and General-Purpose Programs", IEEE Micro 22-2(2002)25--35.
[7]
P. S. Paolucci et al. "Janus: A gigaflop VLIW+RISC Soc Tile", Hot Chips 15 IEEE Stanford Conference (2003). http://www.hotchips.org (note: Janus was the development name of Diopsis. see www.atmelroma.it).
[8]
J. Ying Fai Tong et al. "Reducing Power by Optimizing the Necessary Precision Range of Floating Point Arithmetic", IEEE Trans. On VLSI Systems, 8-3 (2000)273--286.
[9]
P. Faraboschi, G. Desoli, J. A. Fisher, "The Latest Word in Digital and Media Processing", IEEE Signal Processing Mag. 15-2(1998)59--85.
[10]
R. P. Clowell, J. O'Donnell, D. P. Papworth, P. K. Rodman, "Instruction Storage Method with a Compressed Format Using a Mask Word", U.S. Patent 5057837, (Oct 1991).
[11]
P. S. Paolucci, P. Kajfasz et al., "mAgic-FPU and MADE: A customizable VLIW core and the modular VLIW processor architecture description environment", Computer Physics Communication 139(2001)132--143.
[12]
A. Bartoloni, P. S. Paolucci et al., "A Hardware Implementation of the APE100 Architecture", Int. Journ. Mod. Phys. C 4(1993)969.
[13]
N. Cabibbo and P. S. Paolucci, "SIMD algorithm for Matrix Transposition", Int. Journ. Mod. Phys. C 6(1995)183.
[14]
F. Aglietti, P. S. Paolucci, et al., "The teraflop supercomputer APEmille: architecture, software and project status report" Computer Physics Communications, 110,1-3 (May 1998) 216--219
[15]
J. Rabaey, A. Chandrakasan, B. Nikolic, Digital Integrated Circuits, 2nd Edition, Prentice-Hall (2003) Chapter 4 and 9.
[16]
E. de Kock, G. Essink, W. Smits, P. van der Wolf, J.-Y. Brunel, W. Kruijtzer, P. Lieverse, and K. Vissers. YAPI: Application modeling for signal processing systems. In Proc. 37th Design Automation Conference (DAC'2000), pages 402--405, Los Angeles, CA, June 5-9 2000.
[17]
G. E. Erwin de Kock. Y-Chart Application Programmer's Interface: The YAPI Programmer's and Reference Guide Version 2.0.0.
[18]
P. van derWolf, E. de Kock, T. Henriksson, W. Kruijtzer, and G. Essink. Design and Programming of Embedded Multiprocessors: An Interface-Centric Approach. In Proc. IEEE-ACM-IFIP International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS'04), Stockholm, Sweden, Sept. 8-10 2004.
[19]
T. Kogel, M. Doerper, A. Wieferink, R. Leupers, G. Ascheid, H. Meyr, S. Goossens "A Modular Simulation Framework for Architectural Exploration of On-Chip Interconnection Networks" In The First IEEE/ACM/IFIP International Conference on HW/SW Codesign and System Synthesis, Newport Beach (California USA), Oct 2003
[20]
T. Kempf, M. Dörper, R. Leupers, G. Ascheid, H. Meyr "A Modular Simulation Framework for Spatial and Temporal Task Mapping onto Multi-Processor SoC Platforms", In Proc.of the Conference on Design, Automation & Test in Europe (DATE), Munich, Germany, March 2005
[21]
T. Kempf, K. Karuri, S. Wallentowitz, G. Ascheid, R. Leupers, H. Meyer "A SW Performance Estimation Framework for Early System-Level-Design Using Fine-Grained Instrumentation" In Proc.of the Conference on Design, Automation & Test in Europe (DATE), Munich, Germany, March 2006
[22]
S. Chakraborty, S. Künzli, and L. Thiele. "A general framework for analyzing system properties in platform-based embedded system designs," In Proc. 6th Design, Automation and Test in Europe (DATE), pages 190--195, March 2003.
[23]
K. Keutzer, S. Malik, A. Newton, J. Rabaey, and A. Sangiovanni-Vincentelli, "System level design: Orthogonalization of concerns and platform-based design," IEEE Trans. on Computer-Aided Design of Integrated Circuits and Systems, vol. 19, no. 12, Dec. 2000.
[24]
"Virtual Platform Designer" http://www.CoWare.com
[25]
"Chess/Checkers, a retargetable tool-suite for embedded processors", Target Compiler Technologies, http://www.retarget.com/doc/target-whitepaper.pdf.
[26]
L. Thiele, S. Chakraborty, and M. Naedele. "Real-time calculus for scheduling hard real-time systems," In Proc. IEEE International Symposium on Circuits and Systems (ISCAS), volume 4, pages 101--104, 2000.
[27]
G. Kahn, "The semantics of a simple language for parallel programming," in Proc. of the IFIP Congress 74, (1974).
[28]
Belletti, F. et al. "Computing for LQCD: apeNEXT", Computing in Science and Engineering, 8-1, pp. 18--29, Jan/Feb, 2006

Cited By

View all
  • (2019)A Hybrid Execution Approach to Improve the Performance of Dataflow Applications2019 International SoC Design Conference (ISOCC)10.1109/ISOCC47750.2019.9027648(107-108)Online publication date: 6-Oct-2019
  • (2018)Software Compilation Techniques for Heterogeneous Embedded Multi-Core SystemsHandbook of Signal Processing Systems10.1007/978-3-319-91734-4_28(1021-1062)Online publication date: 14-Oct-2018
  • (2017)Towards Automated Variant Selection for Heterogeneous Tiled ArchitecturesModels, Algorithms, Logics and Tools10.1007/978-3-319-63121-9_19(382-399)Online publication date: 25-Jul-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CODES+ISSS '06: Proceedings of the 4th international conference on Hardware/software codesign and system synthesis
October 2006
328 pages
ISBN:1595933700
DOI:10.1145/1176254
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 22 October 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. MP-SOC
  2. RISC
  3. VLIW
  4. application mapping
  5. binding
  6. distributed network processors
  7. embedded systems
  8. hardware dependent software
  9. model based design
  10. network of processes
  11. retargetable compiler
  12. scheduling
  13. simulation
  14. tiled parallel architectures

Qualifiers

  • Article

Conference

ESWEEK06
ESWEEK06: Second Embedded Systems Week 2006
October 22 - 25, 2006
Seoul, Korea

Acceptance Rates

Overall Acceptance Rate 280 of 864 submissions, 32%

Upcoming Conference

ESWEEK '24
Twentieth Embedded Systems Week
September 29 - October 4, 2024
Raleigh , NC , USA

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 22 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2019)A Hybrid Execution Approach to Improve the Performance of Dataflow Applications2019 International SoC Design Conference (ISOCC)10.1109/ISOCC47750.2019.9027648(107-108)Online publication date: 6-Oct-2019
  • (2018)Software Compilation Techniques for Heterogeneous Embedded Multi-Core SystemsHandbook of Signal Processing Systems10.1007/978-3-319-91734-4_28(1021-1062)Online publication date: 14-Oct-2018
  • (2017)Towards Automated Variant Selection for Heterogeneous Tiled ArchitecturesModels, Algorithms, Logics and Tools10.1007/978-3-319-63121-9_19(382-399)Online publication date: 25-Jul-2017
  • (2014)Design of an NoC Interface Macrocell with Hardware Support of Advanced Networking FunctionalitiesIEEE Transactions on Computers10.1109/TC.2012.7063:3(609-621)Online publication date: 1-Mar-2014
  • (2014)EURETILE Design FlowProceedings of the 2014 IEEE International Symposium on Parallel and Distributed Processing with Applications10.1109/ISPA.2014.32(182-189)Online publication date: 26-Aug-2014
  • (2013)Checkpointing SystemC-Based Virtual PlatformsAdoption and Optimization of Embedded and Real-Time Communication Systems10.4018/978-1-4666-2776-5.ch012(222-237)Online publication date: 2013
  • (2013)Predictability for timing and temperature in multiprocessor system-on-chip platformsACM Transactions on Embedded Computing Systems10.1145/2435227.243524412:1s(1-25)Online publication date: 21-Mar-2013
  • (2013)legaSCiProceedings of the 2013 IEEE 27th International Symposium on Parallel and Distributed Processing Workshops and PhD Forum10.1109/IPDPSW.2013.34(2188-2193)Online publication date: 20-May-2013
  • (2013)Software Compilation Techniques for MPSoCsHandbook of Signal Processing Systems10.1007/978-1-4614-6859-2_37(1215-1257)Online publication date: 10-May-2013
  • (2013)Methods and Tools for Mapping Process Networks onto Multi-Processor Systems-On-ChipHandbook of Signal Processing Systems10.1007/978-1-4614-6859-2_27(867-903)Online publication date: 10-May-2013
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media