Accepted Papers

A Code Generator for High-Performance Tensor Contractions on GPUs
Jinsung Kim, Aravind Sukumaran-Rajam, Vineeth Thumma (The Ohio State University), Sriram Krishnamoorthy, Ajay Panyala (Pacific Northwest National Laboratory), Louis-Noel Pouchet (Colorado State University), Atanas Rountev, P. Sadayappan (The Ohio State University)

An Optimization-Driven Incremental Inline Substitution Algorithm for Just-In-Time Compilers
Aleksandar Prokopec, Gilles Duboscq (Oracle Labs), David Leopoldseder (Johannes Kepler University Linz), Thomas Wuerthinger (Oracle Labs)

Automatic Equivalence Checking for Assembly Implementations of Cryptography Libraries
Jay P. Lim, Santosh Nagarakatte (Rutgers University)

Automatic Generation of Warp-Level Primitives and Atomic Operations for Fast-Portable GPU Reductions
Simon Garcia De Gonzalo, Sitao Huang (University of Illinois at Urbana–Champaign), Juan Gomez-Luna (Swiss Federal Institute of Technology(ETH) Zurich), Simon Hammond (Sandia National Laboratories), Onur Mutlu (Swiss Federal Institute of Technology (ETH) Zurich), Wen-mei Hwu (University of Illinois at Urbana–Champaign), Juan Gomez-Luna (Swiss Federal Institute of Technology(ETH) Zurich)

BOLT: A Practical Binary Optimizer for Data Centers and Beyond
Maksim Panchenko, Rafael Auler, Guilherme Ottoni (Facebook), Bill Nell

Causer: Context-Sensitive Overflow Detection
Hongyu Liu, Sam Silvestro, Xiaoyin Wang, Lide Duan, Tongping Liu (the University of Texas at San Antonio)

Decoding CUDA Binary
Ari B. Hayes, Fei Hua, Jin Huang, Yanhao Chen, Eddy Z. Zhang, Ari Hayes (Rutgers University)

From Loop Fusion to Kernel Fusion: A Domain-specific Approach to Locality Optimization
Bo Qiao, Oliver Reiche, Frank Hannig, Jürgen Teich (Department of Computer Science, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU))

Function Merging by Sequence Alignment
Rodrigo C. O. Rocha, Pavlos Petoumenos (University of Edinburgh, UK), Zheng Wang (Lancaster University, UK), Murray Cole, Hugh Leather (University of Edinburgh, UK)

IGC: The Open Source Intel Graphics Compiler
Weiyu Chen, Po-yu Chen, Guei-Yuan Lueh, Peng Guo, Wei Pan, Thomas F Raoux, Pankaj Mistry, Gang Y Chen, Shruthi Hebbur Prasanna Kumar, Junjie Gu, Konrad Trifunovic, Anupama Chandrasekhar (Intel)

Janus: Statically-Driven and Profile-Guided Automatic Dynamic Binary Parallelization
Ruoyu Zhou, Timothy M. Jones (University of Cambridge)

Locus: a System and a Language for Program Optimization
Thiago S F X Teixeira (University of Illinois at Urbana-Champaign), Corinne Ancourt (Mines ParisTech), David Padua, William Gropp (University of Illinois at Urbana-Champaign)

Quantifying and Reducing Execution Variance in STM via Model Driven Commit Optimization
Girish Mururu, Ada Gavrilovska, Santosh Pande (Georgia Institute of Technology)

Reasoning about the Node.js Event Loop Using Async Graphs
Haiyang Sun (Università della Svizzera italiana), Daniele Bonetta (Oracle Labs), Filippo Schiavio, Walter Binder (Università della Svizzera italiana)

Safe Performance Test in Memory-Unsafe Languages
Marcus Rodrigues, Breno Guimaraes, Fernando Quintao (UFMG)

Smokestack: Thwarting DOP Attacks with Runtime Stack Layout Randomization
Misiker Tadesse Aga, Todd Austin (University of Michigan)

Sparse Tensor Algebra Optimization with Workspaces
Fredrik Kjolstad, Peter Ahrens, Shoaib Kamil, Saman Amarasinghe (MIT)

Super-Node SLP: Optimized Vectorization for Code Sequences Containing Operators and their Inverse Elements
Vasileios Porpodas (Intel Corporation), Rodrigo C. O. Rocha (University of Edinburgh), Evgueni Brevnov (Intel Corporation), Luis F. W. Goes (PUC Minas), Timothy Mattson (Intel Corporation)

Tiramisu: A Polyhedral Compiler with A Scheduling Language for Targeting High Performance Systems
Riyadh Baghdadi, Jessica Ray, Malek Ben Romdhane (MIT), Emanuele Del Sozzo (Politecnico di Milano), Abdurrahman Akkas, Yunming Zhang (MIT), Patricia Suriana (Google), Shoaib Kamil (Adobe), Saman Amarasinghe (MIT)

Transforming Query Sequences for High-Throughput B+ Tree Processing on Many-core Processors
Ruiqin Tian (College of William and Mary), Junqiao Qiu, Zhijia Zhao (University of California, Riverside), Xu Liu, Bin Ren (College of William and Mary)

White-Box Program Tuning
Wen-Chuan Lee, Yingqi Liu, Peng Liu, Shiqing Ma, Hongjun Choi, Xiangyu Zhang (Purdue University), Rajiv Gupta (UC Riverside)