Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization

CGO '08: Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization

April 2008

2008 Proceeding

General Chair:
Mary Lou Soffa
University of Virginia, USA
,
Program Chair:
Evelyn Duesterwald
IBM Research, USA

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

CGO '08: 6th Annual IEEE / ACM International Symposium on Code Generation and Optimization Boston MA USA April 5 - 9, 2008

ISBN:

978-1-59593-978-4

Published:

06 April 2008

Sponsors:

SIGPLAN, ACM, SIGMICRO

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Get Alerts for this ConferenceAlerts Save to BinderBinder

Save to Binder

Create a New Binder

Name

Export CitationCitation

Share on

Reflects downloads up to 22 Sep 2024Bibliometrics

Citation count

804

Downloads (6 weeks)

Downloads (12 months)

307

Downloads (cumulative)

15,561

Sections

CGO '08: Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization

2008

Previous Next

Abstract

No abstract available.

Proceeding Downloads

PDF(title page, copyright, welcome, contents, organization)

PDF(author index)

Skip Table Of Content Section

Select All

Export Citations Save to Binder

SESSION: JIT optimizations

research-article

Perfdiff: a framework for performance difference analysis in a virtual machine environment

Xiaotong Zhuang,
Suhyun Kim,
Mauri io Serrano,
Jong-Deok Choi

Pages 4–13https://doi.org/10.1145/1356058.1356060

Although applications running on virtual machines, such as Java, can achieve platform independence, performance evaluation and analysis becomes difficult due to extra intermediate layers and the dynamic nature of virtual execution environment.

We ...

- 17
- 514
Metrics
Total Citations17
Total Downloads514
Last 12 Months5
Last 6 weeks1

Abstract
Get Access

research-article

Automatic array inlining in java virtual machines

Christian Wimmer,
Hanspeter Mössenböck

Pages 14–23https://doi.org/10.1145/1356058.1356061

Array inlining expands the concepts of object inlining to arrays. Groups of objects and arrays that reference each other are placed consecutively in memory so that their relative offsets are fixed, i.e. they are colocated. This allows memory loads to be ...

- 28
- 425
Metrics
Total Citations28
Total Downloads425
Last 12 Months10
Last 6 weeks1

Abstract
Get Access

research-article

Phase-based adaptive recompilation in a JVM

Dayong Gu,
Clark Verbrugge

Pages 24–34https://doi.org/10.1145/1356058.1356062

Modern JIT compilers often employ multi-level recompilation strategies as a means of ensuring the most used code is also the most highly optimized, balancing optimization costs and expected future performance. Accurate selection of code to compile and ...

- 21
- 476
Metrics
Total Citations21
Total Downloads476
Last 12 Months6
Last 6 weeks0

Abstract
Get Access

SESSION: Static program analysis

research-article

Fast liveness checking for ssa-form programs

Benoit Boissinot,
Sebastian Hack,
Daniel Grund,
Benoît Dupont de Dine hin,
Fabri e Rastello

Pages 35–44https://doi.org/10.1145/1356058.1356064

Liveness analysis is an important analysis in optimizing compilers. Liveness information is used in several optimizations and is mandatory during the code-generation phase. Two drawbacks of conventional liveness analyses are that their computations are ...

- 23
- 775
Metrics
Total Citations23
Total Downloads775
Last 12 Months13
Last 6 weeks1

Abstract
Get Access

research-article

Near-optimal instruction selection on dags

David Ryan Koes,
Seth Copen Goldstein

Pages 45–54https://doi.org/10.1145/1356058.1356065

Instruction selection is a key component of code generation. High quality instruction selection is of particular importance in the embedded space where complex instruction sets are common and code size is a prime concern. Although instruction selection ...

- 10
- 588
Metrics
Total Citations10
Total Downloads588
Last 12 Months10
Last 6 weeks2

Abstract
Get Access

research-article

Comprehensive path-sensitive data-flow analysis

Aditya Thakur,
R. Govindarajan

Pages 55–63https://doi.org/10.1145/1356058.1356066

Data-flow analysis is an integral part of any aggressive optimizing compiler. We propose a framework for improving the precision of data-flow analysis in the presence of complex control-flow. We initially perform data-flow analysis to determine those ...

- 13
- 617
Metrics
Total Citations13
Total Downloads617
Last 12 Months16
Last 6 weeks6

Abstract
Get Access

SESSION: Profiling and tracing

research-article

Accurate critical path prediction via random trace construction

Pierre Salverda,
Charles Tu ker,
Craig Zilles

Pages 64–73https://doi.org/10.1145/1356058.1356068

We present a new approach to performing program analysis through profile-guided random generation of instruction traces. Using hardware support available in commercial processors, we profile the behavior of individual instructions. Then, in conjunction ...

- 6
- 387
Metrics
Total Citations6
Total Downloads387
Last 12 Months13
Last 6 weeks1

Abstract
Get Access

research-article

Efficient fine-grained binary instrumentationwith applications to taint-tracking

Prateek Saxena,
R Sekar,
Varun Puranik

Pages 74–83https://doi.org/10.1145/1356058.1356069

Fine-grained binary instrumentations, such as those for taint-tracking, have become very popular in computer security due to their applications in exploit detection, sandboxing, malware analysis, etc. However, practical application of taint-tracking has ...

- 51
- 716
Metrics
Total Citations51
Total Downloads716
Last 12 Months21
Last 6 weeks1

Abstract
Get Access

research-article

Branch-on-random

Edward Lee,
Craig Zilles

Pages 84–93https://doi.org/10.1145/1356058.1356070

We propose a new instruction, branch-on-random, that is like a standard conditional branch, except rather than specifying the condition on which the branch should be taken, it specifies a frequency at which the branch should be taken. We show that ...

- 5
- 334
Metrics
Total Citations5
Total Downloads334
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Prediction and trace compression of data access addresses through nested loop recognition

Alain Ketterlin,
Philippe Clauss

Pages 94–103https://doi.org/10.1145/1356058.1356071

This paper describes an algorithm that takes a trace (i.e., a sequence of numbers or vectors of numbers) as input, and from that produces a sequence of loop nests that, when run, produces exactly the original sequence. The input format is suitable for ...

- 28
- 427
Metrics
Total Citations28
Total Downloads427
Last 12 Months20
Last 6 weeks4

Abstract
Get Access

SESSION: Software pipelining

research-article

Latency-tolerant software pipelining in a production compiler

Sebastian Winkel,
Rakesh Krishnaiyer,
Robyn Sampson

Pages 104–113https://doi.org/10.1145/1356058.1356073

In this paper we investigate the benefit of scheduling non-critical loads for a higher latency during software pipelining. "Non-critical" denotes those loads that have sufficient slack in the cyclic data dependence graph so that increasing the ...

- 2
- 434
Metrics
Total Citations2
Total Downloads434
Last 12 Months5
Last 6 weeks0

Abstract
Get Access

research-article

Parallel-stage decoupled software pipelining

Easwaran Raman,
Guilherme Ottoni,
Arun Raman,
Matthew J. Bridges,
David I. August

Pages 114–123https://doi.org/10.1145/1356058.1356074

In recent years, the microprocessor industry has embraced chip multiprocessors (CMPs), also known as multi-core architectures, as the dominant design paradigm. For existing and new applications to make effective use of CMPs, it is desirable that ...

- 99
- 1,124
Metrics
Total Citations99
Total Downloads1,124
Last 12 Months33
Last 6 weeks1

Abstract
Get Access

research-article

Modulo scheduling for highly customized datapaths to increase hardware reusability

Kevin Fan,
Hyun hul Park,
Manjunath Kudlur,
S ott Mahlke

Pages 124–133https://doi.org/10.1145/1356058.1356075

In the embedded domain, custom hardware in the form of ASICs is often used to implement critical parts of applications when performance and energy efficiency goals cannot be met with software implementations on a general purpose processor or DSP. The ...

- 18
- 375
Metrics
Total Citations18
Total Downloads375
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

SESSION: Compiler optimization

research-article

Removing redundancy via exception check motion

Vijay Sundaresan,
Mark Stoodley,
Pramod Ramarao

Pages 134–143https://doi.org/10.1145/1356058.1356077

Partial redundancy elimination aims to reduce the number of times an expression is computed more than once. The traditional Lazy Code Motion (LCM) algorithm formulated by Knoop, Ruthing and Steffen, through its reliance on unordered bit vectors, is ...

- 2
- 318
Metrics
Total Citations2
Total Downloads318
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Fault-safe code motion for type-safe languages

Brian R. Murphy,
Vijay Menon,
Florian T. Schneider,
Tatiana Shpeisman,
Ali-Reza Adl-Tabatabai

Pages 144–154https://doi.org/10.1145/1356058.1356078

Compilers for Java and other type-safe languages have historically worked to overcome overheads and constraints imposed by runtime safety checks and precise exception semantics. We instead exploit these safety properties to perform code motion ...

- 2
- 385
Metrics
Total Citations2
Total Downloads385
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

research-article

Prefetching irregular references for software cache on cell

Tong Chen,
Tao Zhang,
Zehra Sura,
Mar Gonzales Tallada

Pages 155–164https://doi.org/10.1145/1356058.1356079

The IBM Single Source Research Compiler for the Cell processor (the SSC Research Compiler) was developed to manage the complexity of programming the heterogeneous multicore Cell processor. The compiler accepts conventional source programs as input, and ...

- 47
- 777
Metrics
Total Citations47
Total Downloads777
Last 12 Months9
Last 6 weeks0

Abstract
Get Access

research-article

Cole: compiler optimization level exploration

Kenneth Hoste,
Lieven Eeckhout

Pages 165–174https://doi.org/10.1145/1356058.1356080

Modern compilers implement a large number of optimizations which all interact in complex ways, and which all have a different impact on code quality, compilation time, code size, energy consumption, etc. For this reason, compilers typically provide a ...

- 113
- 1,350
Metrics
Total Citations113
Total Downloads1,350
Last 12 Months86
Last 6 weeks7

Abstract
Get Access

SESSION: Compiling for multicore and multithreading

research-article

Spice: speculative parallel iteration chunk execution

Easwaran Raman,
Neil Va hharajani,
Ram Rangan,
David I. August

Pages 175–184https://doi.org/10.1145/1356058.1356082

The recent trend in the processor industry of packing multiple processor cores in a chip has increased the importance of automatic techniques for extracting thread level parallelism. A promising approach for extracting thread level parallelism in ...

- 46
- 419
Metrics
Total Citations46
Total Downloads419
Last 12 Months3
Last 6 weeks0

Abstract
Get Access

research-article

Pipa: pipelined profiling and analysis on multi-core systems

Qin Zhao,
Ioana Cutcutache,
Weng-Fai Wong

Pages 185–194https://doi.org/10.1145/1356058.1356083

Dynamic instrumentation systems are gaining popularity as means of constructing customized program profiling and analysis tools. However, dynamic instrumentation based analysis tools still suffer from performance problems. The overhead of such systems ...

- 34
- 479
Metrics
Total Citations34
Total Downloads479
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

research-article

Program optimization space pruning for a multithreaded gpu

Shane Ryoo,
Christopher I. Rodrigues,
Sam S. Stone,
Sara S. Baghsorkhi,
Sain-Zee Ueng,
John A. Stratton,
Wen-mei W. Hwu

Pages 195–204https://doi.org/10.1145/1356058.1356084

Program optimization for highly-parallel systems has historically been considered an art, with experts doing much of the performance tuning by hand. With the introduction of inexpensive, single-chip, massively parallel platforms, more developers will be ...

- 218
- 2,023
Metrics
Total Citations218
Total Downloads2,023
Last 12 Months46
Last 6 weeks6

Abstract
Get Access

research-article

Compiling for vector-thread architectures

Mark Hampton,
Krste Asanovic

Pages 205–215https://doi.org/10.1145/1356058.1356085

Vector-thread (VT) architectures exploit multiple forms of parallelism simultaneously. This paper describes a compiler for the Scale VT architecture, which takes advantage of the VT features. We focus on compiling loops, and show how the compiler can ...

- 12
- 534
Metrics
Total Citations12
Total Downloads534
Last 12 Months2
Last 6 weeks0

Abstract
Get Access

SESSION: Keynote addresses

keynote

Code optimization of parallel programs: evolutionary vs. revolutionary approaches

Vivek Sarkar

Page 1https://doi.org/10.1145/1356058.1356087

Code optimization has a rich history that dates back over half a century. Over the years, it has contributed deep innovations to address challenges posed by new computer system and programming language features. Examples of the former include ...

- 0
- 887
Metrics
Total Citations0
Total Downloads887
Last 12 Months5
Last 6 weeks0

Abstract
Get Access

keynote

Issues and challenges in compiling for graphics processors

Norm Rubin

Page 2https://doi.org/10.1145/1356058.1356088

Graphics has been one of the best success stories of parallel processing. Using a unique combination of specialized hardware and aspecialized programming model, game developers routinely write high performance code using millions of threads. Each ...

- 3
- 512
Metrics
Total Citations3
Total Downloads512
Last 12 Months2
Last 6 weeks1

Abstract
Get Access

keynote

Parallelism by design: data analysis with sawzall

Robert Griesemer

Page 3https://doi.org/10.1145/1356058.1356089

Very large data sets - telephone call records, network logs, high-resolution satellite images, or web document repositories - are not easily analyzed using traditional database techniques. They may be simply too large, grow too fast, or may not fit well ...

- 4
- 626
Metrics
Total Citations4
Total Downloads626
Last 12 Months0
Last 6 weeks0

Abstract
Get Access

Cited By

Tran K, Jimborean A, Carlson T, Koukos K, Själander M and Kaxiras S SWOOP: software-hardware co-design for non-speculative, execute-ahead, in-order cores Proceedings of the 39th ACM SIGPLAN Conference on Programming Language Design and Implementation, (328-343)
TANASE C and GAITAN V (2013). Threads Pipelining on the CellBE Systems, Advances in Electrical and Computer Engineering, 10.4316/AECE.2013.03019, 13:3, (121-126),

Save to Binder

Create a New Binder

Name

Contributors

Mary Lou Soffa
University of Virginia
- Publication Years1977 - 2024
- Publication counts172
- Citation count7,014
- Available for Download138
- Downloads (cumulative)106,256
- Downloads (12 months)7,476
- Downloads (6 weeks)1,417
- Average Downloads per Article770
- Average Citation per Article41
View Full Profile
Evelyn Duesterwald
IBM Research
- Publication Years1991 - 2022
- Publication counts33
- Citation count3,002
- Available for Download26
- Downloads (cumulative)26,696
- Downloads (12 months)3,500
- Downloads (6 weeks)611
- Average Downloads per Article1,027
- Average Citation per Article91
View Full Profile

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

CGO '10: Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization
NOCS '11: Proceedings of the Fifth ACM/IEEE International Symposium on Networks-on-Chip
HRI '15: Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction

Acceptance Rates

CGO '08 Paper Acceptance Rate 21 of 66 submissions, 32%;

Overall Acceptance Rate 312 of 1,061 submissions, 29%

Year	Submitted	Accepted	Rate
CGO '17	116	26	22%
CGO '16	108	25	23%
CGO '15	88	24	27%
CGO '14	100	29	29%
CGO '12	90	26	29%
CGO '11	105	28	27%
CGO '09	70	26	37%
CGO '08	66	21	32%
CGO '07	84	27	32%
CGO '06	80	29	36%
CGO '05	75	26	35%
CGO '04	79	25	32%
Overall	1,061	312	29%

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Cited By

Save to Binder

Recommendations

CGO '10: Proceedings of the 8th annual IEEE/ACM international symposium on Code generation and optimization

NOCS '11: Proceedings of the Fifth ACM/IEEE International Symposium on Networks-on-Chip

HRI '15: Proceedings of the Tenth Annual ACM/IEEE International Conference on Human-Robot Interaction

Acceptance Rates