Performant portable openMP

G Ozen, M Wolfe - Proceedings of the 31st ACM SIGPLAN International …, 2022 - dl.acm.org
… This can make OpenMP programming … OpenMP model with compiler based automatic
parallelism mapping to increase all three of performance, portability, and productivity of OpenMP

A case study for performance portability using OpenMP 4.5

R Gayatri, C Yang, T Kurth, J Deslippe - … Dallas, TX, USA, November 11-17 …, 2019 - Springer
… using OpenMP 3.0, OpenMP 4.5, … performance of the OpenMP 4.5 implementation with
that of the more architecture-specific implementations, examine the performance of the OpenMP

Evaluating the impact of proposed openmp 5.0 features on performance, portability and productivity

SJ Pennycook, JD Sewall… - … Performance, Portability …, 2018 - ieeexplore.ieee.org
… posed for OpenMP 5.0 – specifically, the metadirective and declare variant directives – may
… an OpenMP 4.5 implementation of miniMD that achieves a performance portability of 59.35% …

OpenUH: An optimizing, portable OpenMP compiler

C Liao, O Hernandez, B Chapman… - Concurrency and …, 2007 - Wiley Online Library
… enhancements to OpenMP and to study its performance on new … focus on performance tuning
both the OpenMP translation … optimizations to improve OpenMP performance on new chip …

[PDF][PDF] Early experiences writing performance portable OpenMP 4 codes

VV Larrea, W Joubert, MG Lopez… - Proc. Cray User Group …, 2016 - cug.org
… and related work, with an overview of the core OpenMP … on the performance portability of
the various ways to use OpenMP, … improve OpenMP for performance portable programming in …

On the performance portability of OpenACC, OpenMP, Kokkos and RAJA

A Marowka - … Conference on High Performance Computing in Asia …, 2022 - dl.acm.org
… the performance portability of high-level parallel programming models. Using the new metric,
the performance portability of OpenACC, OpenMP, … , and high-performance compilers. The …

Breaking the vendor lock: performance portable programming through OpenMP as target independent runtime layer

J Doerfert, M Jasper, J Huber, K Abdelaal… - Proceedings of the …, 2022 - dl.acm.org
… programming of GPUs by generating portable code from the original, vendor-… performance
portable CUDA by leveraging the existing LLVM/OpenMP offloading infrastructure for portable

Pragmatic performance portability with OpenMP 4. x

M Martineau, J Price, S McIntosh-Smith… - OpenMP: Memory …, 2016 - Springer
… model onto their target architectures, and conduct performance testing with a … of performance
portability and propose some straightforward guidelines for writing performance portable

The productivity, portability and performance of OpenMP 4.5 for scientific applications targeting Intel CPUs, IBM CPUs, and NVIDIA GPUs

M Martineau, S McIntosh-Smith - … for Exascale Performance and Portability …, 2017 - Springer
… This research considers the productivity, portability, and performance offered by the OpenMP
OpenMP, including straightforward code mechanisms to improve productivity and portability

Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors

JC Saez, F Castro, M Prieto-Matias - Proceedings of the 49th …, 2020 - dl.acm.org
performance evaluation on multicore systems. We observed that conventional loop-scheduling
OpenMP … that naturally stems from the different performance delivered by big and small …