default search action
Parallel Computing, Volume 40
Volume 40, Number 1, January 2014
- Leonid Yavits, Amir Morad, Ran Ginosar:
The effect of communication and synchronization on Amdahl's law in multicore systems. 1-16 - Lois Curfman McInnes, Barry Smith, Hong Zhang, Richard Tran Mills:
Hierarchical Krylov and nested Krylov methods for extreme-scale computing. 17-31
Volume 40, Number 2, February 2014
- Pavan Balaji, Zhiyi Huang:
Special issue on programming models and applications for multicores and manycores - Guest Editors' Introduction. 33-34 - Mark Utting, Min-Hsien Weng, John G. Cleary:
The JStar language philosophy. 35-50 - Weihua Sheng, Stefan Schürmans, Maximilian Odendahl, Mark Bertsch, Vitaliy Volevach, Rainer Leupers, Gerd Ascheid:
A compiler infrastructure for embedded heterogeneous MPSoCs. 51-68 - Vikas, Nasser Giacaman, Oliver Sinnen:
Multiprocessing with GUI-awareness using OpenMP-like directives in Java. 69-89 - Oded Green, Yitzhak Birk:
Scheduling directives: Accelerating shared-memory many-core processor execution. 90-106 - Zhenning Wang, Long Zheng, Quan Chen, Minyi Guo:
CPU + GPU scheduling with asymptotic profiling. 107-115 - Yu Liu, Kento Emoto, Zhenjiang Hu:
A Generate-Test-Aggregate parallel programming library for systematic parallel programming. 116-135 - Zhijun Hao, Chenning Xie, Haibo Chen, Binyu Zang:
X10-FT: Transparent fault tolerance for APGAS language and runtime. 136-156
Volume 40, Numbers 3-4, March 2014
- Mohammad Reza Selim, Mohammed Ziaur Rahman:
Carrying on the legacy of imperative languages in the future parallel computing era. 1-33 - Jean-Yves L'Excellent, Wissam M. Sid-Lakhdar:
A study of shared-memory parallelism in a multifrontal solver. 34-46
Volume 40, Numbers 5-6, May 2014
- Urban Borstnik, Joost VandeVondele, Valéry Weber, Jürg Hutter:
Sparse matrix multiplication: The distributed block-compressed sparse row library. 47-58 - Yuki Sugimoto, Fumihiko Ino, Kenichi Hagihara:
Improving cache locality for GPU-based volume rendering. 59-69 - Ray-Bing Chen, Yaohung M. Tsai, Weichung Wang:
Adaptive block size for dense QR factorization in hybrid CPU-GPU systems via statistical modeling. 70-85 - Michael J. Hallock, John E. Stone, Elijah Roberts, Corey Fry, Zaida Luthey-Schulten:
Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-GPU workstations. 86-99 - Ivan Teixido, Francesc Sebé, Josep Conde, Francesc Solsona:
MPI-based implementation of an enhanced algorithm to solve the LPN problem in a memory-constrained environment. 100-112 - Alberto F. Martín, Ruymán Reyes, Rosa M. Badia, Enrique S. Quintana-Ortí:
Leveraging task-parallelism in message-passing dense matrix factorizations using SMPSs. 113-128 - Jose Antonio Pascual, José Miguel-Alonso, José Antonio Lozano:
Application-aware metrics for partition selection in cube-shaped topologies. 129-139 - Robert Hallberg, Alistair Adcroft:
An order-invariant real-to-integer conversion sum. 140-143 - Oscar Peredo, Julián M. Ortiz, José R. Herrero, Cristóbal Samaniego:
Tuning and hybrid parallelization of a genetic-based multi-point statistics simulation code. 144-158
Volume 40, Number 7, July 2014
- Costas Bekas, Ananth Grama, Yousef Saad, Olaf Schenk:
Parallel matrix algorithms. 159-160 - Robert Andrew, Nicholas J. Dingle:
Implementing QR factorization updating algorithms on GPUs. 161-172 - Yiannis Cotronis, Elias Konstantinidis, Maria A. Louka, Nikolaos M. Missirlis:
A comparison of CPU and GPU implementations for solving the Convection Diffusion equation using the local Modified SOR method. 173-185 - Thomas Auckenthaler, Thomas Huckle, Roland Wittmann:
A blocked QR-decomposition for the parallel symmetric eigenvalue problem. 186-194 - Hasan Metin Aktulga, Lin Lin, Christopher Haine, Esmond G. Ng, Chao Yang:
Parallel eigenvalue calculation based on multiple shift-invert Lanczos and contour integral based spectral projection method. 195-212 - Marc Baboulin, Dulceneia Becker, George Bosilca, Anthony Danalis, Jack J. Dongarra:
An efficient distributed randomized algorithm for solving large dense symmetric indefinite linear systems. 213-223 - Pieter Ghysels, Wim Vanroose:
Hiding global synchronization latency in the preconditioned Conjugate Gradient algorithm. 224-238 - Erhan Turan, Peter Arbenz:
Large scale micro finite element analysis of 3D bone poroelasticity. 239-250 - Michele Martone:
Efficient multithreaded untransposed, transposed or symmetric sparse matrix-vector multiplication with the Recursive Sparse Blocks format. 251-270 - Lars Karlsson, Bo Kågström, Eddie Wadbro:
Fine-grained bulge-chasing kernels for strongly scalable parallel QR algorithms. 271-288 - Johannes Langguth, Ariful Azad, Mahantesh Halappanavar, Fredrik Manne:
On parallel push-relabel based algorithms for bipartite maximum matching. 289-308 - Jesús Cámara, Javier Cuenca, Luis-Pedro García, Domingo Giménez:
Auto-tuned nested parallelism: A way to reduce the execution time of scientific software in NUMA systems. 309-327 - Emanuel H. Rubensson, Elias Rudberg:
Chunks and Tasks: A programming model for parallelization of dynamic algorithms. 328-343
Volume 40, Number 8, August 2014
- María Botón-Fernández, Miguel A. Vega-Rodríguez, Francisco Prieto Castrillo:
Self-adaptivity for grid applications. An Efficient Resources Selection model based on evolutionary computation algorithms. 345-361 - Chihiro Kodama, Masaaki Terai, Akira T. Noda, Yohei Yamada, Masaki Satoh, Tatsuya Seiki, Shin-ichi Iga, Hisashi Yashiro, Hirofumi Tomita, Kazuo Minami:
Scalable rank-mapping algorithm for an icosahedral grid system on the massive parallel computer with a 3-D torus network. 362-373 - Julio Sánchez-Curto, Pedro Chamorro-Posada, Graham S. McDonald:
Efficient parallel implementation of the nonparaxial beam propagation method. 394-407 - Jie Chen, Tom L. H. Li, Mihai Anitescu:
A parallel linear solver for multilevel Toeplitz systems with possibly several right-hand sides. 408-424 - Roman Wyrzykowski, Lukasz Szustak, Krzysztof Rojek:
Parallelization of 2D MPDATA EULAG algorithm on hybrid architectures with GPU accelerators. 425-447
Volume 40, Number 9, October 2014
- João Andrade, Gabriel Falcão Paiva Fernandes, Vítor Manuel Mendes da Silva:
Optimized Fast Walsh-Hadamard Transform on GPUs for non-binary LDPC decoding. 449-453
- Ehsan Totoni, Michael T. Heath, Laxmikant V. Kalé:
Structure-adaptive parallel solution of sparse triangular linear systems. 454-470 - Diego Arroyuelo, Carolina Bonacic, Veronica Gil-Costa, Mauricio Marín, Gonzalo Navarro:
Distributed text search using suffix arrays. 471-495 - Yingchong Situ, Chandra S. Martha, Matthew E. Louis, Zhiyuan Li, Ahmed H. Sameh, Gregory A. Blaisdell, Anastasios S. Lyrintzis:
Petascale large eddy simulation of jet engine noise based on the truncated SPIKE algorithm. 496-511
- Lucas Mello Schnorr, Philippe Olivier Alexandre Navaux:
Best of SBAC-PAD 2012. 512-513 - Luiz E. Ramos, Ricardo Bianchini:
Robust performance in hybrid-memory cooperative caches. 514-525 - Joefon Jann, R. Sarma Burugula, Ching-Farn Eric Wu, Kaoutar El Maghraoui:
Towards an immortal operating system in virtual environments. 526-535 - Esteban Meneses, Osman Sarood, Laxmikant V. Kalé:
Energy profile of rollback-recovery strategies in high performance computing. 536-547 - Teo Milanez, Caroline Collange, Fernando Magno Quintão Pereira, Wagner Meira Jr., Renato Ferreira:
Thread scheduling and memory coalescing for dynamic vectorization of SPMD workloads. 548-558
Volume 40, Number 10, December 2014
- Li Tan, Shashank Kothapalli, Longxiang Chen, Omar Hussaini, Ryan Bissiri, Zizhong Chen:
A survey of power and energy efficient techniques for high performance numerical linear algebra operations. 559-573
- Antonio J. Peña, Carlos Reaño, Federico Silla, Rafael Mayo, Enrique S. Quintana-Ortí, José Duato:
A complete and efficient CUDA-sharing solution for HPC clusters. 574-588 - George Teodoro, Tony Pan, Tahsin M. Kurç, Jun Kong, Lee A. D. Cooper, Scott Klasky, Joel H. Saltz:
Region templates: Data representation and management for high-throughput image analysis. 589-610 - Yizhuo Wang, Yang Zhang, Yan Su, Xiaojun Wang, Xu Chen, Weixing Ji, Feng Shi:
An adaptive and hierarchical task scheduling scheme for multi-core clusters. 611-627 - Andrew White, Soo-Young Lee:
Derivation of optimal input parameters for minimizing execution time of matrix-based computations on a GPU. 628-645 - Nicholas Horelik, Andrew R. Siegel, Benoit Forget, Kord Smith:
Monte Carlo domain decomposition for robust nuclear reactor analysis. 646-660 - Leandro A. J. Marzulo, Tiago A. O. Alves, Felipe M. G. França, Vítor Santos Costa:
Couillard: Parallel programming via coarse-grained Data-flow Compilation. 661-680
- Philip C. Roth, Yong Chen:
Guest Editors' introduction to the special issue on "DISCS-2013". 681 - Jesse Weaver, Vito Giovanni Castellana, Alessandro Morari, Antonino Tumeo, Sumit Purohit, Alan R. Chappell, David Haglin, Oreste Villa, Sutanay Choudhury, Karen Schuchardt, John Feo:
Toward a data scalable solution for facilitating discovery of science resources. 682-696 - Jiangling Yin, Junyao Zhang, Jun Wang, Wu-chun Feng:
SDAFT: A novel scalable data access framework for parallel BLAST. 697-709 - Yong Li, Dan Feng, Zhan Shi:
Heterogeneous-aware cache partitioning: Improving the fairness of shared storage cache. 710-721 - Joong-Yeon Cho, Hyun-Wook Jin, Min Lee, Karsten Schwan:
Dynamic core affinity for high-performance file upload on Hadoop Distributed File System. 722-737 - Peter Coetzee, Matthew Leeke, Stephen A. Jarvis:
Towards unified secure on- and off-line analytics at scale. 738-753 - Dominique LaSalle, George Karypis:
MPI for Big Data: New tricks for an old dog. 754-767 - Lan Vu, Gita Alaghband:
Novel parallel method for association rule mining on multi-core shared memory systems. 768-785
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.