Less is more: Exploiting the standard compiler optimization levels for better performance and energy consumption
K Georgiou, C Blackmore, S Xavier-de-Souza… - Proceedings of the 21st …, 2018 - dl.acm.org
Proceedings of the 21st International Workshop on Software and Compilers for …, 2018•dl.acm.org
This paper presents the interesting observation that by performing fewer of the optimizations
available in a standard compiler optimization level such as-02, while preserving their
original ordering, significant savings can be achieved in both execution time and energy
consumption. This observation has been validated on two embedded processors, namely
the ARM Cortex-M0 and the ARM Cortex-M3, using two different versions of the LLVM
compilation framework; v3. 8 and v5. 0. Experimental evaluation with 71 embedded …
available in a standard compiler optimization level such as-02, while preserving their
original ordering, significant savings can be achieved in both execution time and energy
consumption. This observation has been validated on two embedded processors, namely
the ARM Cortex-M0 and the ARM Cortex-M3, using two different versions of the LLVM
compilation framework; v3. 8 and v5. 0. Experimental evaluation with 71 embedded …
This paper presents the interesting observation that by performing fewer of the optimizations available in a standard compiler optimization level such as -02, while preserving their original ordering, significant savings can be achieved in both execution time and energy consumption. This observation has been validated on two embedded processors, namely the ARM Cortex-M0 and the ARM Cortex-M3, using two different versions of the LLVM compilation framework; v3.8 and v5.0. Experimental evaluation with 71 embedded benchmarks demonstrated performance gains for at least half of the benchmarks for both processors. An average execution time reduction of 2.4% and 5.3% was achieved across all the benchmarks for the Cortex-M0 and Cortex-M3 processors, respectively, with execution time improvements ranging from 1% up to 90% over the -02. The savings that can be achieved are in the same range as what can be achieved by the state-of-the-art compilation approaches that use iterative compilation or machine learning to select flags or to determine phase orderings that result in more efficient code. In contrast to these time consuming and expensive to apply techniques, our approach only needs to test a limited number of optimization configurations, less than 64, to obtain similar or even better savings. Furthermore, our approach can support multi-criteria optimization as it targets execution time, energy consumption and code size at the same time.
ACM Digital Library