


default search action
19th CGO 2021: Seoul, South Korea
- Jae W. Lee, Mary Lou Soffa, Ayal Zaks:
IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2021, Seoul, South Korea, February 27 - March 3, 2021. IEEE 2021, ISBN 978-1-7281-8613-9
Frontmatter
- Jae W. Lee:
Message from the General Chair. iii-iv - Mary Lou Soffa, Ayal Zaks:
Message from the Program Chairs. v - Jubi Taneja, Michel Steuwer:
Report from the Artifact Evaluation Committee. x-xi
Keynote
- Mary W. Hall:
Data Layout and Data Representation Optimizations to Reduce Data Movement Keynote. 1
Compiler Infrastructure
- Chris Lattner, Mehdi Amini
, Uday Bondhugula
, Albert Cohen
, Andy Davis, Jacques A. Pienaar
, River Riddle, Tatiana Shpeisman, Nicolas Vasilache
, Oleksandr Zinenko
:
MLIR: Scaling Compiler Infrastructure for Domain Specific Computation. 2-14 - Lorenzo Chelini, Andi Drebes, Oleksandr Zinenko, Albert Cohen, Nicolas Vasilache, Tobias Grosser
, Henk Corporaal:
Progressive Raising in Multi-level IR. 15-26 - Thomas Koehler
, Michel Steuwer:
Towards a Domain-Extensible Compiler: Optimizing an Image Processing Pipeline on Mobile CPUs. 27-38 - Ajay Brahmakshatriya, Saman P. Amarasinghe:
BuildIt: A Type-Based Multi-stage Programming Framework for Code Generation in C++. 39-51
Dealing with Precision
- Joao Rivera, Franz Franchetti, Markus Püschel:
An Interval Compiler for Sound Floating-Point Computations. 52-64 - Tiago Trevisan Jost, Yves Durand, Christian Fabre
, Albert Cohen, Frédéric Pétrot:
Seamless Compiler Integration of Variable Precision Floating-Point Arithmetic. 65-76 - Jian Weng, Animesh Jain, Jie Wang, Leyuan Wang, Yida Wang, Tony Nowatzki:
UNIT: Unifying Tensorized Instruction Compilation. 77-89 - Guangli Li, Jingling Xue, Lei Liu, Xueying Wang
, Xiu Ma, Xiao Dong, Jiansong Li
, Xiaobing Feng:
Unleashing the Low-Precision Computation Potential of Tensor Cores on GPUs. 90-102
Binary Profiling, Tracing, Sampling
- Mahwish Arif, Ruoyu Zhou, Hsi-Ming Ho, Timothy M. Jones:
Cinnamon: A Domain-Specific Language for Binary Profiling and Monitoring. 103-114 - Keren Zhou
, Xiaozhu Meng, Ryuichi Sai, John M. Mellor-Crummey
:
GPA: A GPU Performance Advisor Based on Instruction Sampling. 115-125 - Harish Patil, Alexander Isaev, Wim Heirman, Alen Sabu
, Ali Hajiabadi
, Trevor E. Carlson:
ELFies: Executable Region Checkpoints for Performance Analysis and Simulation. 126-136 - David Pankratz, Tyler Nowicki, Ahmed Eltantawy, José Nelson Amaral:
Vulkan Vision: Ray Tracing Workload Characterization using Automatic Graphics Instrumentation. 137-149
Parallelism - Optimizing, Modeling, Testing
- Christos Vasiladiotis
, Roberto Castañeda Lozano, Murray Cole, Björn Franke:
Loop Parallelization using Dynamic Commutativity Analysis. 150-161 - Seungbin Song, Heelim Choi, Hanjun Kim
:
Fine-Grained Pipeline Parallelization for Network Function Programs. 162-173 - Christie L. Alappat
, Johannes Seiferth, Georg Hager, Matthias Korch, Thomas Rauber, Gerhard Wellein:
YaskSite: Stencil Optimization Techniques Applied to Explicit ODE Methods on Modern Architectures. 174-186 - Ting Yuan, Guangwei Li, Jie Lu, Chen Liu, Lian Li, Jingling Xue:
GoBench: A Benchmark Suite of Real-World Go Concurrency Bugs. 187-199
Memory Optimization and Safeness
- Luigi Soares, Fernando Magno Quintão Pereira:
Memory-Safe Elimination of Side Channels. 200-210 - Naveen Namashivavam
, Sanyam Mehta, Pen-Chung Yew
:
Variable-Sized Blocks for Locality-Aware SpMV. 211-221 - Mohamad Barbar, Yulei Sui, Shiping Chen
:
Object Versioning for Flow-Sensitive Pointer Analysis. 222-235 - Haofeng Li
, Haining Meng, Hengjie Zheng, Liqing Cao, Jie Lu, Lian Li, Lin Gao:
Scaling Up the IFDS Algorithm with Efficient Disk-Assisted Computing. 236-247
Compiling Graph Algorithms, Compiling for GPUs
- Ajay Brahmakshatriya, Yunming Zhang, Changwan Hong, Shoaib Kamil, Julian Shun
, Saman P. Amarasinghe:
Compiling Graph Applications for GPU s with GraphIt. 248-261 - Ruohuang Zheng, Sreepathi Pai
:
Efficient Execution of Graph Algorithms on CPU with SIMD Extensions. 262-276 - Alexander Krolik, Clark Verbrugge, Laurie J. Hendren:
r3d3: Optimized Query Compilation on GPUs. 277-288 - Guei-Yuan Lueh, Kaiyu Chen, Gang Chen, Joel Fuentes
, Wei-Yu Chen, Fangwen Fu, Hong Jiang, Hongzheng Li, Daniel Rhee:
C-for-Metal: High Performance Simd Programming on Intel GPUs. 289-300
Compiling for Spatial, Quantum, and Embedded Devices
- Ji Liu
, Luciano Bello, Huiyang Zhou
:
Relaxed Peephole Optimization: A Novel Compiler Optimization for Quantum Circuits. 301-314 - Johannes de Fine Licht
, Andreas Kuster, Tiziano De Matteis
, Tal Ben-Nun, Dominic Hofer
, Torsten Hoefler:
StencilFlow: Mapping Large Stencil Programs to Distributed Spatial Computing Systems. 315-326 - Changsu Kim
, Shinnung Jeong
, Sungjun Cho
, Yongwoo Lee
, William Song
, Youngsok Kim, Hanjun Kim
:
Thread-Aware Area-Efficient High-Level Synthesis Compiler for Embedded Devices. 327-339
JIT and Binary Translation
- Guilherme Ottoni, Bin Liu:
HHVM Jump-Start: Boosting Both Warmup and Steady-State Performance at Scale. 340-350 - Ziyi Zhao
, Zhang Jiang
, Ying Chen, Xiaoli Gong, Wenwen Wang, Pen-Chung Yew
:
Enhancing Atomic Instruction Emulation for Cross-ISA Dynamic Binary Translation. 351-362 - Milind Chabbi, Jin Lin, Raj Barik:
An Experience with Code-Size Optimization for Production iOS Mobile Applications. 363-377 - Anderson Faustino da Silva, Bruno Conde Kind, José Wesley de Souza Magalhães
, Jerônimo Nunes Rocha, Breno Campos Ferreira Guimarães, Fernando Magno Quintão Pereira:
ANGHABENCH: A Suite with One Million Compilable C Benchmarks for Code-Size Reduction. 378-390

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.