default search action
11th VECPAR 2014: Eugene, OR, USA
- Michel J. Daydé, Osni Marques, Kengo Nakajima:
High Performance Computing for Computational Science - VECPAR 2014 - 11th International Conference, Eugene, OR, USA, June 30 - July 3, 2014, Revised Selected Papers. Lecture Notes in Computer Science 8969, Springer 2015, ISBN 978-3-319-17352-8
Algorithms for GPU and Manycores
- Langshi Chen, Serge G. Petiton, Leroy Anthony Drummond, Maxime R. Hugues:
A Communication Optimization Scheme for Basis Computation of Krylov Subspace Methods on Multi-GPUs. 3-16 - Ichitaro Yamazaki, Stanimire Tomov, Tingxing Dong, Jack J. Dongarra:
Mixed-Precision Orthogonalization Scheme and Adaptive Step Size for Improving the Stability and Performance of CA-GMRES on GPUs. 17-30 - Azzam Haidar, Piotr Luszczek, Stanimire Tomov
, Jack J. Dongarra:
Heterogenous Acceleration for Linear Algebra in Multi-coprocessor Environments. 31-42 - Fan Ye, Christophe Calvin, Serge G. Petiton:
A Study of SpMV Implementation Using MPI and OpenMP on Intel Many-Core Architecture. 43-56 - Masatoshi Kawai, Takeshi Iwashita, Hiroshi Nakashima:
SIMD Implementation of a Multiplicative Schwarz Smoother for a Multigrid Poisson Solver on an Intel Xeon Phi Coprocessor. 57-65 - Futoshi Mori, Masaharu Matsumoto, Takashi Furumura:
Performance Optimization of the 3D FDM Simulation of Seismic Wave Propagation on the Intel Xeon Phi Coprocessor Using the ppOpen-APPL/FDM Library. 66-76
Large-Scale Applications
- Prasanna Balaprakash
, Yuri Alexeev, Sheri A. Mickelson, Sven Leyffer
, Robert L. Jacob, Anthony P. Craig:
Machine-Learning-Based Load Balancing for Community Ice Code Component in CESM. 79-91 - Timothy B. Costa, David Foster, Malgorzata Peszynska
:
Domain Decomposition for Heterojunction Problems in Semiconductors. 92-101 - Heidi K. Thornquist, Sivasankaran Rajamanickam:
A Hybrid Approach for Parallel Transistor-Level Full-Chip Circuit Simulation. 102-111
Numerical Algorithms
- Hartwig Anzt
, Dimitar Lukarski, Stanimire Tomov
, Jack J. Dongarra:
Self-adaptive Multiprecision Preconditioners on Multicore and Manycore Architectures. 115-123 - Ziming Zheng, Andrew A. Chien, Keita Teranishi:
Fault Tolerance in an Inner-Outer Solver: A GVR-Enabled Case Study. 124-132
Direct/Hybrid Methods for Solving Sparse Matrices
- Marc Baboulin, Xiaoye S. Li, François-Henry Rouet:
Using Random Butterfly Transformations to Avoid Pivoting in Sparse Direct Methods. 135-144 - Joshua Dennis Booth, Padma Raghavan:
Hybrid Sparse Linear Solutions with Substituted Factorization. 145-155 - Patrick Amestoy, Jean-Yves L'Excellent, François-Henry Rouet, Wissam M. Sid-Lakhdar:
Modeling 1D Distributed-Memory Dense Kernels for an Asynchronous Multifrontal Sparse Solver. 156-169
Performance Tuning
- Steven H. Langer, Ian Karlin, Michael M. Marinak:
Performance Characteristics of HYDRA - A Multi-physics Simulation Code from LLNL. 173-181 - Mark Gates
, Azzam Haidar, Jack J. Dongarra:
Accelerating Computation of Eigenvectors in the Dense Nonsymmetric Eigenvalue Problem. 182-191 - Kenji Ono, Shuichi Chiba, Shunsuke Inoue, Kazuo Minami:
Low Byte/Flop Implementation of Iterative Solver for Sparse Matrices Derived from Stencil Computations. 192-205
The Ninth International Workshop on Automatic Performance Tuning
- Yu Lin, Franjo Ivancic, Pallavi Joshi, Gogul Balakrishnan, Malay K. Ganai, Aarti Gupta:
Environment-Sensitive Performance Tuning for Distributed Service Orchestration. 209-223 - Shahzeb Siddiqui
, Fatemah AlZayer
, Saber Feki
:
Historic Learning Approach for Auto-tuning OpenACC Accelerated Scientific Applications. 224-235 - Richard Veras, Franz Franchetti:
Capturing the Expert: Generating Fast Matrix-Multiply Kernels with Spiral. 236-244 - Elmar Peise, Paolo Bientinesi:
A Study on the Influence of Caching: Sequences of Dense Linear Algebra Kernels. 245-258 - France Boillod-Cerneux, Serge G. Petiton, Christophe Calvin, Leroy Anthony Drummond:
Toward Restarting Strategies Tuning for a Krylov Eigenvalue Solver. 259-268 - Takeshi Fukaya, Toshiyuki Imamura, Yusaku Yamamoto:
Performance Analysis of the Householder-Type Parallel Tall-Skinny QR Factorizations Toward Automatic Algorithm Selection. 269-283 - Takeshi Minami, Motoharu Hibino, Tasuku Hiraishi, Takeshi Iwashita, Hiroshi Nakashima:
Automatic Parameter Tuning of Three-Dimensional Tiled FDTD Kernel. 284-297 - Alfian Amrizal, Shoichi Hirasawa, Hiroyuki Takizawa
, Hiroaki Kobayashi:
Automatic Parameter Tuning of Hierarchical Incremental Checkpointing. 298-309
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.