A Novel Memory Subsystem and Computational Model for Parallel Reconfigurable Architectures

Yamuna Rajasekhar²⁷ &
Ron Sass²⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 8374))

Included in the following conference series:

European Conference on Parallel Processing

1783 Accesses

Abstract

While FPGA and other reconfigurable technologies have dramatically increased in size and speed, memory technology has had only modest improvements. Relative to logic speeds, memory latency is virtually flat and physical constraints on external pins limit memory bandwidth. Unfortunately, the traditional cache hierarchy found in fixedfunction integrated circuits has evolved to support sequential processors and is ineffective for highly parallel architectures. This paper proposes a novel memory subsystem and computational model for reconfigurable architectures.

It envisions a system where computational cores are oversubscribed with atomic tasks and the memory subsystem enables (1) hiding of latency by enabling the cores to overlap computation and memory transactions and (2) the system to fully utilize the available memory bandwidth. The first step in this grand vision is to change the memory model. Instead of a byte-addressable, global address space, a named segment memory controller is introduced and an FPGA-based implementation presented in this paper.

Download to read the full chapter text

Chapter PDF

Reconfigurable Memories

FPGA-Extended General Purpose Computer Architecture

A New Memory Address Transformation for Continuous-Flow FFT Processors with SIMD Extension

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Albonesi, D.: Selective cache ways: on-demand cache resource allocation. In: Proceedings of the 32nd Annual International Symposium on Microarchitecture, MICRO-32, pp. 248–259 (1999)
Google Scholar
Bertsimas, D., Nakazato, D.: The distributional little’s law and its applications. Operations Research 43(2), 298–310 (1995)
Article MATH MathSciNet Google Scholar
Frigo, M., Johnson, S.: Fftw: an adaptive software architecture for the fft. In: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 3, pp. 1381–1384 (1998)
Google Scholar
Huang, B., Sass, R., DeBardeleben, N., Blanchard, S.: Pydac: A resilient run-time framework for divide-and-conquer applications on a heterogeneous many-core architecture. In: The 6th Workshop on UnConventional High Performance Computing, UCHPC at Euro-Par 2013 (2013)
Google Scholar
Johnson, J.R.: Automated performance tuning. In: Proceedings of the 4th International Workshop on Parallel and Symbolic Computation, PASCO 2010, pp. 20–21. ACM, New York (2010), http://doi.acm.org/10.1145/1837210.1837215
Kozyrakis, C., Patterson, D.: A new direction for computer architecture research. Computer 31(11), 24–32 (1998)
Article Google Scholar
Njoroge, N., Casper, J., Wee, S., Teslyar, Y., Ge, D., Kozyrakis, C., Olukotun, K.: Atlas: a chip-multiprocessor with transactional memory support. In: Proceedings of the Conference on Design, Automation and Test in Europe, DATE 2007, pp. 3–8, EDA Consortium, San Jose (2007), http://dl.acm.org/citation.cfm?id=1266366.1266370
Kogge, P., et al.: Exascale computing study: Technology challenges in achieving exascale systems. Tech. Rep. TR-2008-13, DARPA Information Processing Techniques Office (IPTO) sponsored study (2008), http://www.cse.nd.edu/Reports/2008TR-2008-13.pdf
Rajasekhar, Y., Sass, R.: A first analysis of a dynamic memory allocation controller (dmac) core. In: Proceedings of the 2011 Symposium on Application Accelerators in High-Performance Computing, SAAHPC 2011, pp. 64–67. IEEE Computer Society, Washington, DC (2011), http://dx.doi.org/10.1109/SAAHPC.2011.23
Chapter Google Scholar
Wulf, W.A., McKee, S.A.: Hitting the memory wall: implications of the obvious. SIGARCH Comput. Archit. News 23(1), 20–24 (1995), http://doi.acm.org/10.1145/216585.216588
Article Google Scholar

Download references

Author information

Authors and Affiliations

Reconfigurable Computing Systems Laboratory, University of North Carolina at Charlotte, USA
Yamuna Rajasekhar & Ron Sass

Authors

Yamuna Rajasekhar
View author publications
You can also search for this author in PubMed Google Scholar
Ron Sass
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Rechen- und Kommunikationszentrum, RWTH Aachen, Seffenter Weg 23, 52074, Aachen, Germany
Dieter an Mey
TU Vienna, 1040, Vienna, Austria
Michael Alexander
RWTH Aachen University, Seffenter Weg 23, 52074, Aachen, Germany
Paolo Bientinesi & Carsten Clauss &
University Magna Graecia of Catanzaro, 88100, Catanzaro, Italy
Mario Cannataro
Inria Rennes - Bretagne Atlantique, 35042, Rennes, France
Alexandru Costan & Christine Morin &
University of Innsbruck, 6020, Innsbruck, Austria
Gabor Kecskemeti
Department of Computer Science, University of Pisa, 56126, Pisa, Italy
Laura Ricci
Universitat Politècnica de València, 46022, València, Spain
Julio Sahuquillo
LLNL, USA
Martin Schulz
Dipartimento di Informatica, Università di Salerno, 84084, Salerno, Italy
Vittorio Scarano
Tennessee Tech University and Oak Ridge National Laboratory, 38505, Cookeville, TN, USA
Stephen L. Scott
Technische Universität München, 80333, Munich, Germany
Josef Weidendorfer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rajasekhar, Y., Sass, R. (2014). A Novel Memory Subsystem and Computational Model for Parallel Reconfigurable Architectures. In: an Mey, D., et al. Euro-Par 2013: Parallel Processing Workshops. Euro-Par 2013. Lecture Notes in Computer Science, vol 8374. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-54420-0_44

Download citation

DOI: https://doi.org/10.1007/978-3-642-54420-0_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-54419-4
Online ISBN: 978-3-642-54420-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Novel Memory Subsystem and Computational Model for Parallel Reconfigurable Architectures

Abstract

Chapter PDF

Similar content being viewed by others

Reconfigurable Memories

FPGA-Extended General Purpose Computer Architecture

A New Memory Address Transformation for Continuous-Flow FFT Processors with SIMD Extension

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Novel Memory Subsystem and Computational Model for Parallel Reconfigurable Architectures

Abstract

Chapter PDF

Similar content being viewed by others

Reconfigurable Memories

FPGA-Extended General Purpose Computer Architecture

A New Memory Address Transformation for Continuous-Flow FFT Processors with SIMD Extension

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation