default search action
19th CGO 2021: Seoul, South Korea
- Jae W. Lee, Mary Lou Soffa, Ayal Zaks:
IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2021, Seoul, South Korea, February 27 - March 3, 2021. IEEE 2021, ISBN 978-1-7281-8613-9
Frontmatter
- Jae W. Lee:
Message from the General Chair. iii-iv - Mary Lou Soffa, Ayal Zaks:
Message from the Program Chairs. v - Jubi Taneja, Michel Steuwer:
Report from the Artifact Evaluation Committee. x-xi
Keynote
- Mary W. Hall:
Data Layout and Data Representation Optimizations to Reduce Data Movement Keynote. 1
Compiler Infrastructure
- Chris Lattner, Mehdi Amini, Uday Bondhugula, Albert Cohen, Andy Davis, Jacques A. Pienaar, River Riddle, Tatiana Shpeisman, Nicolas Vasilache, Oleksandr Zinenko:
MLIR: Scaling Compiler Infrastructure for Domain Specific Computation. 2-14 - Lorenzo Chelini, Andi Drebes, Oleksandr Zinenko, Albert Cohen, Nicolas Vasilache, Tobias Grosser, Henk Corporaal:
Progressive Raising in Multi-level IR. 15-26 - Thomas Koehler, Michel Steuwer:
Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUs. 27-38 - Ajay Brahmakshatriya, Saman P. Amarasinghe:
BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++. 39-51
Dealing with Precision
- Joao Rivera, Franz Franchetti, Markus Püschel:
An Interval Compiler for Sound Floating-Point Computations. 52-64 - Tiago Trevisan Jost, Yves Durand, Christian Fabre, Albert Cohen, Frédéric Pétrot:
Seamless Compiler Integration of Variable Precision Floating-Point Arithmetic. 65-76 - Jian Weng, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki:
UNIT: Unifying Tensorized Instruction Compilation. 77-89 - Guangli Li, Jingling Xue, Lei Liu, Xueying Wang, Xiu Ma, Xiao Dong, Jiansong Li, Xiaobing Feng:
Unleashing the Low-Precision Computation Potential of Tensor Cores on GPUs. 90-102
Binary Profiling, Tracing, Sampling
- Mahwish Arif, Ruoyu Zhou, Hsi-Ming Ho, Timothy M. Jones:
Cinnamon: A Domain-Specific Language for Binary Profiling and Monitoring. 103-114 - Keren Zhou, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey:
GPA: A GPU Performance Advisor Based on Instruction Sampling. 115-125 - Harish Patil, Alexander Isaev, Wim Heirman, Alen Sabu, Ali Hajiabadi, Trevor E. Carlson:
ELFies: Executable Region Checkpoints for Performance Analysis and Simulation. 126-136 - David Pankratz, Tyler Nowicki, Ahmed Eltantawy, José Nelson Amaral:
Vulkan Vision: Ray Tracing Workload Characterization using Automatic Graphics Instrumentation. 137-149
Parallelism - Optimizing, Modeling, Testing
- Christos Vasiladiotis, Roberto Castañeda Lozano, Murray Cole, Björn Franke:
Loop Parallelization using Dynamic Commutativity Analysis. 150-161 - Seungbin Song, Heelim Choi, Hanjun Kim:
Fine-Grained Pipeline Parallelization for Network Function Programs. 162-173 - Christie L. Alappat, Johannes Seiferth, Georg Hager, Matthias Korch, Thomas Rauber, Gerhard Wellein:
YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures. 174-186 - Ting Yuan, Guangwei Li, Jie Lu, Chen Liu, Lian Li, Jingling Xue:
GoBench: A Benchmark Suite of Real-World Go Concurrency Bugs. 187-199
Memory Optimization and Safeness
- Luigi Soares, Fernando Magno Quintão Pereira:
Memory-Safe Elimination of Side Channels. 200-210 - Naveen Namashivavam, Sanyam Mehta, Pen-Chung Yew:
Variable-Sized Blocks for Locality-Aware SpMV. 211-221 - Mohamad Barbar, Yulei Sui, Shiping Chen:
Object Versioning for Flow-Sensitive Pointer Analysis. 222-235 - Haofeng Li, Haining Meng, Hengjie Zheng, Liqing Cao, Jie Lu, Lian Li, Lin Gao:
Scaling Up the IFDS Algorithm with Efficient Disk-Assisted Computing. 236-247
Compiling Graph Algorithms, Compiling for GPUs
- Ajay Brahmakshatriya, Yunming Zhang, Changwan Hong, Shoaib Kamil, Julian Shun, Saman P. Amarasinghe:
Compiling Graph Applications for GPU s with GraphIt. 248-261 - Ruohuang Zheng, Sreepathi Pai:
Efficient Execution of Graph Algorithms on CPU with SIMD Extensions. 262-276 - Alexander Krolik, Clark Verbrugge, Laurie J. Hendren:
r3d3: Optimized Query Compilation on GPUs. 277-288 - Guei-Yuan Lueh, Kaiyu Chen, Gang Chen, Joel Fuentes, Wei-Yu Chen, Fangwen Fu, Hong Jiang, Hongzheng Li, Daniel Rhee:
C-for-Metal: High Performance Simd Programming on Intel GPUs. 289-300
Compiling for Spatial, Quantum, and Embedded Devices
- Ji Liu, Luciano Bello, Huiyang Zhou:
Relaxed Peephole Optimization: A Novel Compiler Optimization for Quantum Circuits. 301-314 - Johannes de Fine Licht, Andreas Kuster, Tiziano De Matteis, Tal Ben-Nun, Dominic Hofer, Torsten Hoefler:
StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems. 315-326 - Changsu Kim, Shinnung Jeong, Sungjun Cho, Yongwoo Lee, William Song, Youngsok Kim, Hanjun Kim:
Thread-Aware Area-Efficient High-Level Synthesis Compiler for Embedded Devices. 327-339
JIT and Binary Translation
- Guilherme Ottoni, Bin Liu:
HHVM Jump-Start: Boosting Both Warmup and Steady-State Performance at Scale. 340-350 - Ziyi Zhao, Zhang Jiang, Ying Chen, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew:
Enhancing Atomic Instruction Emulation for Cross-ISA Dynamic Binary Translation. 351-362 - Milind Chabbi, Jin Lin, Raj Barik:
An Experience with Code-Size Optimization for Production iOS Mobile Applications. 363-377 - Anderson Faustino da Silva, Bruno Conde Kind, José Wesley de Souza Magalhães, Jerônimo Nunes Rocha, Breno Campos Ferreira Guimarães, Fernando Magno Quintão Pereira:
ANGHABENCH: A Suite with One Million Compilable C Benchmarks for Code-Size Reduction. 378-390
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.