CDA Loop Transformations

Dattatraya Kulkarni³ &
Michael Stumm³

57 Accesses
2 Citations

Abstract

In this paper we present a new loop transformation technique called Computation Decomposition and Alignment (CDA). Computation Decomposition first decomposes the iteration space into finer computation spaces. Computation Alignment subsequently, linearly transforms each computation space independently. CDA is a general framework in that linear transformations and its recent extensions are just special cases of CDA. CDA’s fine grained loop restructuring can incur considerable computational effort, but can exploit optimization opportunities that earlier frameworks cannot. We present four optimization contexts in which CDA can be useful. Our initial experiments demonstrate that CDA adds a new dimension to performance optimization.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Semi-automatic Composition of Data Layout Transformations for Loop Vectorization

Distributing and Parallelizing Non-canonical Loops

AlphaZ: A System for Design Space Exploration in the Polyhedral Model

References

Abraham, S.G., and Hudak, D.E. Compile-time partitioning of iterative parallel loops to reduce cache coherency traffic, IEEE Transactions on Parallel and Distributed Systems, 2(3):318–328, July 91.
Article Google Scholar
Allen, R., Callahan, D., and Kennedy, K. Automatic decomposition of scientific programs for parallel execution, In Conference Record of the 14th Annual ACM Symposium on Principles of Programming Languages, pages 63–76, Munich, West Germany, January 1987.
Google Scholar
Ancourt, C. and Irigoin, F. Scanning polyhedra with DO loops, In Proceedings of the 3rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, volume 26, pages 39–50, Williamsburg, VA, April 1991.
Chapter Google Scholar
Anderson, J. and Lam, M. Global optimizations for parallelism and locality on scalable parallel machines, In Proceedings of the ACM SIGPLAN ‘83 Conference on Programming Language Design and Implementation, volume 28, June 1993.
Google Scholar
Banerjee, U. Unimodular transformations of double loops, In Proceedings of Third Workshop on Programming Languages and Compilers for Parallel Computing, Irvine, CA, August 1990.
Google Scholar
Feautrier, P. Dataflow analysis of array and scalar references. International Journal of Parallel Programming, 20, 1991.
Google Scholar
Gilbert, J. and Schreiber, R. Optimal expression evaluation for data parallel architectures, Journal of Parallel and Distributed Computing, 13:58–64, 1991.
Article Google Scholar
Irigoin, F. and Triolet, R. Supernode partitioning, In Conference Record of the 15th Annual ACM Symposium on Principles of Programming Languages, pages 319–329, San Diego, CA, 1988.
Google Scholar
Kelly, W. and Pugh, W. A framework for unifying reordering transformations, Technical Report UMIACS-TR-92–126, University of Maryland, 1992.
Google Scholar
Kelly, W., Pugh, W., and Rosser, E. Code generation for multiple mappings, Technical Report UMIACS-TR-94–87, University of Maryland, 1994.
Google Scholar
Kulkarni, D. and Stumm, M. Computational alignment: A new, unified pro-gram transformation for local and global optimization, Technical Report CSRI-292, Computer Systems Research Institute, University of Toronto, January 1994. http://www.eecg.toronto.edu/EECG/RESEARCH/ParallelSys
Kulkarni, D., Stumm, M., Unrau, R., and Li, W. A generalized the-ory of linear loop transformations, Technical Report CSRI-317, Com-puter Systems Research Institute, University of Toronto, December 1994. http://www.eecg.toronto.edu/EECG/RESEARCH/ParallelSys
Kumar, K.G., Kulkarni, D., and Basu, A. Deriving good transformations for mapping nested loops on hierarchical parallel machines in polynomial time, In Proceedings of the 1992 ACM International Conference on Supercomputing, Washington, July 1992.
Google Scholar
Li, C.H. Program wanall. ftp://ftp.cs.rice.edu, Rice University, 1992.
Li, W. and Pingali, K. A singular loop transformation framework based on non-singular matrices, In Proceedings of the Fifth Workshop on Programming Languages and Compilers for Parallel Computing, August 1992.
Google Scholar
Mosher, C. Arco Seismic Benchmarks, ARCO E&PT.
Google Scholar
NASA, Ames Research Center. NAS Parallel Benchmarks
Google Scholar
Padua, D. Multiprocessors: Discussion of some theoretical and practical problems, Phd thesis, University of Illinois, Urbana-Champaign, 1979.
Google Scholar
Padua, D. and Wolfe, M. Advanced compiler optimizations for supercomputers, Communications of the ACM, 29(12):1184–1201, December 1986.
Article Google Scholar
Pugh, W. and Wonnacott, D. An exact method for analysis of value-based array data dependences, Technical Report CS-TR-3196, University of Maryland, 1993.
Google Scholar
Torres, J., Ayguade, E., Labarta, J., and Valero, M. Align and distribute-based linear loop transformations, In Proceedings of Sixth Workshop on Programming Languages and Compilers for Parallel Computing, 1993.
Google Scholar
Wolf, M. and Lam, M. An algorithmic approach to compound loop transformation, In Proceedings of Third Workshop on Programming Languages and Compilers for Parallel Computing, Irvine, CA, August 1990.
Google Scholar
Wolfe, M. Optimizing supercompilers for supercomputers. The MIT Press, 1990.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Department of Electrical and Computer Engineering, University of Toronto, Toronto, M5S 1A4, Canada
Dattatraya Kulkarni & Michael Stumm

Authors

Dattatraya Kulkarni
View author publications
You can also search for this author in PubMed Google Scholar
Michael Stumm
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Rensselaer Polytechnic Institute, Troy, NY, USA
Boleslaw K. Szymanski
IBM Corporation, Poughkeepsie, NY, USA
Balaram Sinharoy

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kulkarni, D., Stumm, M. (1996). CDA Loop Transformations. In: Szymanski, B.K., Sinharoy, B. (eds) Languages, Compilers and Run-Time Systems for Scalable Computers. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-2315-4_3

Download citation

DOI: https://doi.org/10.1007/978-1-4615-2315-4_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-5979-1
Online ISBN: 978-1-4615-2315-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

CDA Loop Transformations

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Semi-automatic Composition of Data Layout Transformations for Loop Vectorization

Distributing and Parallelizing Non-canonical Loops

AlphaZ: A System for Design Space Exploration in the Polyhedral Model

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

CDA Loop Transformations

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Semi-automatic Composition of Data Layout Transformations for Loop Vectorization

Distributing and Parallelizing Non-canonical Loops

AlphaZ: A System for Design Space Exploration in the Polyhedral Model

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation