default search action
46th ICPP 2017: Bristol, UK
- 46th International Conference on Parallel Processing, ICPP 2017, Bristol, United Kingdom, August 14-17, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-1042-8
Highlighted Papers (S1-T1)
- Ivy Bo Peng, Roberto Gioiosa, Gokcen Kestor, Erwin Laure, Stefano Markidis:
Preparing HPC Applications for the Exascale Era: A Decoupling Strategy. 1-10 - Guojing Cong, Onkar Bhardwaj, Minwei Feng:
An Efficient, Distributed Stochastic Gradient Descent Algorithm for Deep-Learning Applications. 11-20 - Yingrui Wang, Leisheng Li, Rong Tian:
Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster. 21-30
Graph Analytics and ML (S2-T1)
- Erik Vermij, Leandro Fiorin, Christoph Hagleitner, Koen Bertels:
Boosting the Efficiency of HPCG and Graph500 with Near-Data Processing. 31-40 - Han Dong, Tao Li, Jiabing Leng, Lingyan Kong, Gang Bai:
GCN: GPU-Based Cube CNN Framework for Hyperspectral Image Classification. 41-49 - Mallipeddi Hardhik, Dip Sankar Banerjee, Kiran Raj Ramamoorthy, Kishore Kothapalli, Kannan Srinathan:
Nearly Balanced Work Partitioning for Heterogeneous Algorithms. 50-59
Enhancing Programming Runtime Systems (S2-T2)
- Adrián Castelló, Sangmin Seo, Rafael Mayo, Pavan Balaji, Enrique S. Quintana-Ortí, Antonio J. Peña:
GLTO: On the Adequacy of Lightweight Thread Approaches for OpenMP Implementations. 60-69 - Jordyn Maglalang, Sriram Krishnamoorthy, Kunal Agrawal:
Locality-Aware Dynamic Task Graph Scheduling. 70-80 - Tingzhe Zhou, Pantea Zardoshti, Michael F. Spear:
Practical Experience with Transactional Lock Elision. 81-90
Linear Algebra Algorithms (S2-T3)
- Hartwig Anzt, Jack J. Dongarra, Goran Flegar, Enrique S. Quintana-Ortí:
Variable-Size Batched LU for Small Matrices and Its Integration into Block-Jacobi Preconditioning. 91-100 - Yusuke Nagasaka, Akira Nukada, Satoshi Matsuoka:
High-Performance and Memory-Saving Sparse General Matrix-Matrix Multiplication for NVIDIA Pascal GPU. 101-110 - Shaden Smith, Alec Beri, George Karypis:
Constrained Tensor Factorization with Accelerated AO-ADMM. 111-120
Data and Networks (S3-T1)
- Victor Garcia-Flores, Eduard Ayguadé, Antonio J. Peña:
Efficient Data Sharing on Heterogeneous Systems. 121-130 - Vikram K. Narayana, Shuai Sun, Armin Mehrabian, Volker J. Sorger, Tarek A. El-Ghazawi:
HyPPI NoC: Bringing Hybrid Plasmonics to an Opto-Electronic Network-on-Chip. 131-140 - Xiaokang Hu, Wang Zhang, Jian Li, Ruhui Ma, Feng Wu, Haibing Guan:
ES2: Aiming at an Optimal Virtual I/O Event Path. 141-150
GPU & Runtime Systems (S3-T2)
- Akshay Venkatesh, Khaled Hamidouche, Sreeram Potluri, Davide Rossetti, Ching-Hsiang Chu, Dhabaleswar K. Panda:
MPI-GDS: High Performance MPI Designs with GPUDirect-aSync for CPU-GPU Control Flow Decoupling. 151-160 - Ching-Hsiang Chu, Xiaoyi Lu, Ammar Ahmad Awan, Hari Subramoni, Jahanzeb Maqbool Hashmi, Bracy Elton, Dhabaleswar K. Panda:
Efficient and Scalable Multi-Source Streaming Broadcast on GPU Clusters for Deep Learning. 161-170 - Burak Bastem, Didem Unat, Weiqun Zhang, Ann S. Almgren, John Shalf:
Overlapping Data Transfers with Computation on GPU with Tiles. 171-180
Graphs and Networks (S3-T3)
- Jiawen Sun, Hans Vandierendonck, Dimitrios S. Nikolopoulos:
Accelerating Graph Analytics by Utilising the Memory Locality of Graph Partitioning. 181-190 - Hari Sundar, Parmeshwar Khurd:
Parallel Algorithms for the Computation of Cycles in Relative Neighborhood Graphs. 191-200 - Minho Bae, Junho Eum, Donghoon Kim, Sangyoon Oh:
High Performance Query Processing for Web Scale RDF Data using BSP Style Communication and Balanced Distribution. 201-210
Storage (S4-T1)
- Xiaoyang Qu, Jiguang Wan, Fengguang Song, Xiaozhao Zhuang, Fei Wu, Changsheng Xie:
OptiMatch: Enabling an Optimal Match between Green Power and Various Workloads for Renewable-Energy Powered Storage Systems. 211-220 - Luyu Li, Houxiang Ji, Chentao Wu, Jie Li, Minyi Guo:
Favorable Block First: A Comprehensive Cache Scheme to Accelerate Partial Stripe Recovery of Triple Disk Failure Tolerant Arrays. 221-230 - Yanwen Xie, Dan Feng, Fang Wang:
Non-Sequential Striping for Distributed Storage Systems with Different Redundancy Schemes. 231-240
IO & Cloud (S4-T2)
- Yi Su, Dan Feng, Yu Hua, Zhan Shi:
Predicting Response Latency Percentiles for Cloud Object Storage Systems. 241-250 - Mehmet Fatih Aktas, Javier Diaz Montes, Ivan Rodero, Manish Parashar:
WA-Dataspaces: Exploring the Data Staging Abstractions for Wide-Area Distributed Scientific Workflows. 251-260 - Matthew Curtis-Maury, Ram Kesavan, Mrinal K. Bhattacharjee:
Scalable Write Allocation in the WAFL File System. 261-270
Numerical Applications (S4-T3)
- Minyoung Jung, Jinwoo Park, Johann Blieberger, Bernd Burgstaller:
Parallel Construction of Simultaneous Deterministic Finite Automata on Shared-Memory Multicores. 271-281 - Sudip K. Seal, Mark R. Cianciosa, Steven P. Hirshman, Andreas Wingen, Robert S. Wilcox, Ezekial A. Unterberg:
Parallel Reconstruction of Three Dimensional Magnetohydrodynamic Equilibria in Plasma Confinement Devices. 282-291 - Athena Elafrou, Georgios I. Goumas, Nectarios Koziris:
Performance Analysis and Optimization of Sparse Matrix-Vector Multiplication on Modern Multi- and Many-Core Processors. 292-301
Networks (S5-T1)
- Lei Yang, Jiannong Cao, Zhenyu Wang, Weigang Wu:
Network Aware Multi-User Computation Partitioning in Mobile Edge Clouds. 302-311 - Chenxi Qiu, Haiying Shen:
Fading-Resistant Link Scheduling in Wireless Networks. 312-321 - Ryota Yasudo, Michihiro Koibuchi, Koji Nakano, Hiroki Matsutani, Hideharu Amano:
Order/Radix Problem: Towards Low End-to-End Latency Interconnection Networks. 322-331
Cloud Scheduling (S5-T2)
- MohammadReza HoseinyFarahabady, Javid Taheri, Zahir Tari, Albert Y. Zomaya:
A Dynamic Resource Controller for a Lambda Architecture. 332-341 - Sunimal Rathnayake, Dumitrel Loghin, Yong Meng Teo:
CELIA: Cost-Time Performance of Elastic Applications on Cloud. 342-351 - Hervé Yviquel, Guido Araujo:
The Cloud as an OpenMP Offloading Device. 352-361
GPU Applications (S5-T3)
- Takumi Honda, Shinnosuke Yamamoto, Hiroaki Honda, Koji Nakano, Yasuaki Ito:
Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations. 362-371 - Kubilay Atasu, Thomas P. Parnell, Celestine Dünner, Michail Vlachos, Haralampos Pozidis:
High-Performance Recommender System Training Using Co-Clustering on CPU/GPU Clusters. 372-381 - Govert G. Brinkmann, Kristian F. D. Rietveld, Frank W. Takes:
Exploiting GPUs for Fast Force-Directed Visualization of Large-Scale Networks. 382-391
Data and IO (S6-T1)
- Long Cheng, Ying Wang, Yulong Pei, Dick H. J. Epema:
A Coflow-Based Co-Optimization Framework for High-Performance Data Analytics. 392-401 - Zhipeng Li, Yinlong Xu, Yongkun Li, Chengjin Tian, Youhui Bai:
PDS: An I/O-Efficient Scaling Scheme for Parity Declustered Data Layout. 402-411 - Yang Wang, Shuibing He, Xiaopeng Fan, Chengzhong Xu, Joseph C. Culberson, Joseph Horton:
Data Caching in Next Generation Mobile Cloud Services, Online vs. Off-Line. 412-421
Computation Optimization (S6-T2)
- Lijuan Jiang, Chao Yang, Yulong Ao, Wanwang Yin, Wenjing Ma, Qiao Sun, Fangfang Liu, Rongfen Lin, Peng Zhang:
Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor. 422-431 - James Lin, Zhigeng Xu, Akira Nukada, Naoya Maruyama, Satoshi Matsuoka:
Optimizations of Two Compute-Bound Scientific Kernels on the SW26010 Many-Core Processor. 432-441 - Shixiong Xu, David Gregg:
Bitslice Vectors: A Software Approach to Customizable Data Precision on Processors with SIMD Extensions. 442-451
Data Analytics (S6-T3)
- Yang You, James Demmel:
Runtime Data Layout Scheduling for Machine Learning Dataset. 452-461 - Kamesh Arumugam, Desh Ranjan, Mohammad Zubair, Balsa Terzic, Alexander N. Godunov, Tunazzina Islam:
A Machine Learning Approach for Efficient Parallel Simulation of Beam Dynamics on GPUs. 462-471 - Charalampos Stylianopoulos, Magnus Almgren, Olaf Landsiedel, Marina Papatriantafilou:
Multiple Pattern Matching for Network Security Applications: Acceleration through Vectorization. 472-482
Graph Algorithms (S7-T1)
- Erik Saule, Dinesh Panchananam, Alexander Hohl, Wenwu Tang, Eric Delmelle:
Parallel Space-Time Kernel Density Estimation. 483-492 - Peng Ni, Masatoshi Hanai, Wen Jun Tan, Chen Wang, Wentong Cai:
Parallel Algorithm for Single-Source Earliest-Arrival Problem in Temporal Graphs. 493-502 - Mustafa Kemal Tas, Kamer Kaya, Erik Saule:
Greed Is Good: Parallel Algorithms for Bipartite-Graph Partial Coloring on Multicore Architectures. 503-512
Performance & Power Tuning for Heterogeneous Platforms (S7-T2)
- Isuru Dilanka Fernando, Sanath Jayasena, Milinda Fernando, Hari Sundar:
A Scalable Hierarchical Semi-Separable Library for Heterogeneous Clusters. 513-522 - Robert V. Lim, Boyana Norris, Allen D. Malony:
Autotuning GPU Kernels via Static and Predictive Analysis. 523-532 - Aniket Chakrabarti, Srinivasan Parthasarathy, Christopher Stewart:
A Pareto Framework for Data Analytics on Heterogeneous Systems: Implications for Green Energy Usage and Performance. 533-542
Various Parallel Algorithms (S8-T1)
- Ayham Kassab, Jean-Marc Nicod, Laurent Philippe, Veronika Rehn-Sonigo:
Scheduling Independent Tasks in Parallel under Power Constraints. 543-552 - Eduardo Moscoso Rubino, Alberto Jose Alvares, Raúl Marín Prades, Pedro Sanz Valero:
A Novel Minimum Time Parallel 2-D Discrete Wavelet Transform Algorithm for General Purpose Processors. 553-562 - Harshvardhan Das, Subodh Kumar:
A Parallel TSP-Based Algorithm for Balanced Graph Partitioning. 563-570
Resilience & Power Aware Scheduling (S8-T2)
- Xunyun Liu, Aaron Harwood, Shanika Karunasekera, Benjamin I. P. Rubinstein, Rajkumar Buyya:
E-Storm: Replication-Based State Management in Distributed Stream Processing Systems. 571-580 - Aiman Fang, Aurélien Cavelan, Yves Robert, Andrew A. Chien:
Resilience for Stencil Computations with Latent Errors. 581-590 - Rong Ge, Pengfei Zou, Xizhou Feng:
Application-Aware Power Coordination on Power Bounded NUMA Multicore Systems. 591-600
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.