Thermal Management of a Many-Core Processor under Fine-Grained Parallelism

Fuat Keceli³⁰,
Tali Moreshet³¹ &
Uzi Vishkin³⁰

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7155))

Included in the following conference series:

European Conference on Parallel Processing

1530 Accesses

Abstract

In this paper, we present the work in progress that studies the run-time impact of various DTM techniques on a proposed 1024-core XMT chip. XMT aims to improve single task performance using fine-grained parallelism. Via simulations, we show that relative to a general global scheme, speedups of up to 46% with a dedicated interconnection controller and 22% with distributed control of computing clusters are possible. Our findings lead to several high level insights that can impact the design of a broader family of shared memory many-core systems.

Download to read the full chapter text

Chapter PDF

NuPow: Managing Power on NUMA Multiprocessors with Domain-Level Voltage and Frequency Control

An energy-efficient scheduling approach for memory-intensive tasks in multi-core systems

Article 01 August 2022

Prefetching Mechanism for Distributed Cache Architecture: Trends and Challenges

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Balkan, A.O., Horak, M.N., Qu, G., Vishkin, U.: Layout-accurate design and implementation of a high-throughput interconnection network for single-chip parallel processing. In: Proc. Hot Interconnects (2007)
Google Scholar
Bell, N., Garland, M.: Implementing sparse matrix-vector multiplication on throughput-oriented processors. In: Proc. SC (2009)
Google Scholar
Caragea, G., Keceli, F., Tzannes, A., Vishkin, U.: General-purpose vs. GPU: Comparison of many-cores on irregular workloads. In: Proc. HotPar (2010)
Google Scholar
Che, S., Boyer, M., Meng, J., Tarjan, D., Sheaffer, J.W., Lee, S.H., Skadron, K.: Rodinia: A benchmark suite for heterogeneous computing. In: Proc. IISWC (2009)
Google Scholar
Donald, J., Martonosi, M.: Techniques for multicore thermal management: Classification and new exploration. In: Proc. ISCA (2006)
Google Scholar
Ge, Y., Malani, P., Qiu, Q.: Distributed task migration for thermal management in many-core systems. In: Proc. DAC (2010)
Google Scholar
Ginosar, R.: The plural architecture (2011), www.plurality.com , also see course on Parallel Computing, Electrical Engineering, Technion, http://webee.technion.ac.il/courses/048874
Hoberock, J., Bell, N.: Thrust: A parallel template library version 1.1 (2009), http://www.meganewtons.com/
Howard, J., Dighe, S., et al.: A 48-core IA-32 message-passing processor with DVFS in 45nm CMOS. In: Proc. ISSCC (2010)
Google Scholar
Huang, W., Stan, M.R., Sankaranarayanan, K., Ribando, R.J., Skadron, K.: Many-core design from a thermal perspective. In: Proc. DAC (2008)
Google Scholar
Isci, C., Martonosi, M.: Runtime power monitoring in high-end processors: Methodology and empirical data. In: Proc. MICRO (2003)
Google Scholar
Kadin, M., Reda, S., Uht, A.: Central vs. distributed dynamic thermal management for multi-core processors: which one is better? In: Proceedings of the Great Lakes Symposium on VLSI (2009)
Google Scholar
Kaxiras, S., Martonosi, M.: Computer Architecture Techniques for Power Efficiency. Morgan and Claypool Publishers (2008)
Google Scholar
Keceli, F.: Power and Performance Studies of the Explicit Multi-Threading (XMT) Architecture. Ph.D. thesis, University of Maryland (2011)
Google Scholar
Keceli, F., Moreshet, T., Vishkin, U.: Power-performance comparison of single-task driven many-cores, submitted for publication
Google Scholar
Keceli, F., Tzannes, A., Caragea, G., Vishkin, U., Barua, R.: Toolchain for programming, simulating and studying the XMT many-core architecture. In: Proc. HIPS (2011), in conj. with IPDPS
Google Scholar
Keceli, F., Vishkin, U.: XMTSim: Cycle-accurate Simulator of the XMT Many-Core Architecture. Tech. Rep. UMIACS-TR-2011-02, Univ. of Maryland (2011)
Google Scholar
Keller, J., Kessler, C., Traeff, J.L.: Practical PRAM Programming. John Wiley & Sons, Inc., New York (2001)
Google Scholar
Kumar, R., Hinton, G.: A family of 45nm IA processors. In: Proc. ISSCC (2009)
Google Scholar
Li, S., Ahn, J.H., Strong, R.D., Brockman, J.B., Tullsen, D.M., Jouppi, N.P.: McPAT: an integrated power, area, and timing modeling framework for multicore and manycore architectures. In: Proc. MICRO (2009)
Google Scholar
Liu, S., Zhang, J., Wu, Q., Qiu, Q.: Thermal-aware job allocation and scheduling for three dimensional chip multiprocessor. In: Proceedings of the International Symposium on Quality Electronic Design (2010)
Google Scholar
Ma, K., Li, X., Chen, M., Wang, X.: Scalable power control for many-core architectures running multi-threaded applications. In: Proc. ISCA (2011)
Google Scholar
NVIDIA: CUDA SDK 2.3 (2009), www.nvidia.com/cuda
Padua, D., Vishkin, U.: Joint UIUC/UMD parallel algorithms/ programming course. In: Proc. EduPar (2011), in conj. with IPDPS
Google Scholar
Patterson, D.: The trouble with multicore: Chipmakers are busy designing microprocessors that most programmers can’t handle. IEEE Spectrum (July 2010)
Google Scholar
Satish, N., Harris, M., Garland, M.: Designing efficient sorting algorithms for manycore GPUs. In: Proc. IPDPS (2009)
Google Scholar
Skadron, K., Stan, M.R., Huang, W., Velusamy, S., Sankaranarayanan, K., Tarjan, D.: Temperature-aware microarchitecture. In: Proc. ISCA (2003)
Google Scholar
Wen, X., Vishkin, U.: FPGA-based prototype of a PRAM on-chip processor. In: Proc. Comp. Front. (2008)
Google Scholar
Wilton, S., Jouppi, N.: CACTI: an enhanced cache access and cycle time model. IEEE J. Solid-State Circuits 31(5), 677–688 (1996)
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Maryland, College Park, MD, USA
Fuat Keceli & Uzi Vishkin
Swarthmore College, Swarthmore, PA, USA
Tali Moreshet

Authors

Fuat Keceli
View author publications
You can also search for this author in PubMed Google Scholar
Tali Moreshet
View author publications
You can also search for this author in PubMed Google Scholar
Uzi Vishkin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Scilytics, Koellnerhofgasse 3/15A, 1010, Vienna, Austria
Michael Alexander
ICAR-CNR, Via P. Castellino, 111, 80131, Napoli, Italy
Pasqua D’Ambra
University of Amsterdam, 1090, Amsterdam, Netherlands
Adam Belloum
Innovative Computing Laboratory, The University of Tennessee, USA
George Bosilca
Department of Experimental Medicine and Clinic, University Magna Græcia, 88100, Catanzaro, Italy
Mario Cannataro
Computer Science Department, University of Pisa, Italy
Marco Danelutto
Second University of Naples, Italy
Beniamino Di Martino
TU München, Boltzmannstr. 3, 85748, Garching, Germany
Michael Gerndt
Equipe Runtime, INRIA Bordeaux Sud-Ouest, 33405, Talence Cedex, France
Emmanuel Jeannot & Raymond Namyst &
Equipe HIEPACS, INRIA Bordeaux Sud-Ouest, 33405, Talence Cedex, France
Jean Roman
Oak Ridge National Laboratory, Computer Science and Mathematics Division, 37831-6164, Oak Ridge, TN, USA
Stephen L. Scott
Department of Scientific Computing, University of Vienna, Nordbergstr. 15/3C, 1090, Vienna, Austrial
Jesper Larsson Traff
Computer Science and Mathematics Division, Oak Ridge National Laboratory, 37831, Oak Ridge, TN, USA
Geoffroy Vallée
Technische Universität München, Germany
Josef Weidendorfer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Keceli, F., Moreshet, T., Vishkin, U. (2012). Thermal Management of a Many-Core Processor under Fine-Grained Parallelism. In: Alexander, M., et al. Euro-Par 2011: Parallel Processing Workshops. Euro-Par 2011. Lecture Notes in Computer Science, vol 7155. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29737-3_29

Download citation

DOI: https://doi.org/10.1007/978-3-642-29737-3_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29736-6
Online ISBN: 978-3-642-29737-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics