Automatic CUDA Code Synthesis Framework for Multicore CPU and GPU Architectures

Hanwoong Jung¹⁹,
Youngmin Yi²⁰ &
Soonhoi Ha¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7203))

Included in the following conference series:

International Conference on Parallel Processing and Applied Mathematics

2213 Accesses
4 Citations

Abstract

Recently, general purpose GPU (GPGPU) programming has spread rapidly after CUDA was first introduced to write parallel programs in high-level languages for NVIDIA GPUs. While a GPU exploits data parallelism very effectively, task-level parallelism is exploited as a multi-threaded program on a multicore CPU. For such a heterogeneous platform that consists of a multicore CPU and GPU, we propose an automatic code synthesis framework that takes a process network model specification as input and generates a multithreaded CUDA code. With the model based specification, one can explicitly specify both function-level and loop-level parallelism in an application and explore the wide design space in mapping of function blocks and selecting the communication methods between CPU and GPU. The proposed technique is complementary to other high-level methods of CUDA programming.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Efficient heterogeneous programming with FPGAs using the Controller model

Article 06 May 2021

Separate Compilation in a Language-Integrated Heterogeneous Environment

Understanding Data Partition for Applications on CPU-GPU Integrated Processors

References

Kirk, D., Hwu, W.: Programming Massively Parallel Processors: A Hands-on Approach, pp. 78–79. Morgan Kaufmann Publisher (2010)
Google Scholar
Kahn, G.: The semantics of a simple language for parallel programming. In: Proceedings of IFIP Congress, vol. 74, pp. 471–475 (1974)
Google Scholar
Lee, E.A., Messerschmitt, D.G.: Synchronous Data Flow. Proceedings of the IEEE 75(9), 1235–1245 (1987)
Article Google Scholar
Han, T.D., Abdelrahman, T.S.: hiCUDA: A High-level Language for GPU programming. IEEE Transactions on Parallel and Distributed Systems 22(1), 78–90 (2011)
Article Google Scholar
Ayguadé, E., Badia, R.M., Igual, F.D., Labarta, J., Mayo, R., Quintana-Ortí, E.S.: An Extension of the StarSs Programming Model for Platforms with Multiple GPUs. In: Sips, H., Epema, D., Lin, H.-X. (eds.) Euro-Par 2009. LNCS, vol. 5704, pp. 851–862. Springer, Heidelberg (2009)
Chapter Google Scholar
Udupa, A., Govindarajan, R., Thazhuthaveetil, M.J.: Software Pipelined Execution of Stream Programs on GPUs. In: Symposium on Code Generation and Optimization, pp. 200–209 (2009)
Google Scholar
Accelereyes, http://wiki.accelereyes.com/wiki/index.php/Jacket_Documentation
Kwon, S., et al.: A Retargetable Parallel-Programming Framework for MPSoC. In: TODAES, vol. 13, pp. 1–18 (July 2008)
Google Scholar

Download references

Author information

Authors and Affiliations

School of EECS, Seoul National University, Seoul, Korea
Hanwoong Jung & Soonhoi Ha
School of ECE, University of Seoul, Seoul, Korea
Youngmin Yi

Authors

Hanwoong Jung
View author publications
You can also search for this author in PubMed Google Scholar
Youngmin Yi
View author publications
You can also search for this author in PubMed Google Scholar
Soonhoi Ha
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Computer and Information Science, Czestochowa University of Technology, Dabrowskiego 69, 42-201, Czestochowa, Poland
Roman Wyrzykowski & Konrad Karczewski &
Electrical Engineering and Computer Science Department, University of Tennessee, 1122 Volunteer Blvd, 37996-3450, Knoxville, TN, USA
Jack Dongarra
Department of Informatics and Mathematical Modeling, Technical University of Denmark, Richard Petersens Plads, Building 321, 2800, Kongens Lyngby, Denmark
Jerzy Waśniewski

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jung, H., Yi, Y., Ha, S. (2012). Automatic CUDA Code Synthesis Framework for Multicore CPU and GPU Architectures. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2011. Lecture Notes in Computer Science, vol 7203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31464-3_59

Download citation

DOI: https://doi.org/10.1007/978-3-642-31464-3_59
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31463-6
Online ISBN: 978-3-642-31464-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Automatic CUDA Code Synthesis Framework for Multicore CPU and GPU Architectures

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Efficient heterogeneous programming with FPGAs using the Controller model

Separate Compilation in a Language-Integrated Heterogeneous Environment

Understanding Data Partition for Applications on CPU-GPU Integrated Processors

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Automatic CUDA Code Synthesis Framework for Multicore CPU and GPU Architectures

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Efficient heterogeneous programming with FPGAs using the Controller model

Separate Compilation in a Language-Integrated Heterogeneous Environment

Understanding Data Partition for Applications on CPU-GPU Integrated Processors

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation