Proceedings of the 2015 IEEE 22nd International Conference on High Performance Computing (HiPC)

Article

[Front cover]

Page C4https://doi.org/10.1109/HiPC.2015.63

Presents the front cover or splash screen of the proceedings record.

Article

[Title page i]

Page ihttps://doi.org/10.1109/HiPC.2015.1

Presents the title page of the proceedings record.

Article

[Title page iii]

Page iiihttps://doi.org/10.1109/HiPC.2015.2

Presents the title page of the proceedings record.

Article

[Copyright notice]

Page ivhttps://doi.org/10.1109/HiPC.2015.3

Article

Message from the General Co-chairs and Vice-General Co-chairs

Pages x–xihttps://doi.org/10.1109/HiPC.2015.4

Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.

Article

Message from the Program Chair

Page xiihttps://doi.org/10.1109/HiPC.2015.5

Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.

Article

Message from the Steering Chair

Page xiiihttps://doi.org/10.1109/HiPC.2015.62

Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.

Article

HiPC 2015 Committees

Pages xiv–xviiihttps://doi.org/10.1109/HiPC.2015.6

Provides a listing of current committee members and society officers.

Article

HiPC 2015 Technical Program

Page xixhttps://doi.org/10.1109/HiPC.2015.7

Provides a schedule of conference events and a listing of which papers were presented in each session.

Article

Scale-out Beyond Map-Reduce

Raghu Ramakrishnan

Page 1https://doi.org/10.1109/HiPC.2015.59

Until recently, data was gathered for well-defined objectives such as auditing, forensics, reporting and line-ofbusiness operations; now, exploratory and predictive analysis is becoming ubiquitous, and the default increasingly is to capture and store ...

Article

Which Verification for Soft Error Detection?

Pages 2–11https://doi.org/10.1109/HiPC.2015.26

Many methods are available to detect silent errors in high-performance computing (HPC) applications. Each comes with a given cost and recall (fraction of all errors that are actually detected). The main contribution of this paper is to characterize the ...

Article

Throughput Regulation in Shared Memory Multicore Processors

Pages 12–20https://doi.org/10.1109/HiPC.2015.33

Performance scaling is now synonymous with scaling the number of cores. One of the consequences of this shift is the increasing difficulty of designing processors with predictable and controllable performance. To address this challenge this paper ...

Article

Application Taxonomy via Algorithmic Commonality for Domain-Specific Architecture Desgin

Pages 21–29https://doi.org/10.1109/HiPC.2015.35

In this paper, we propose an approach of application taxonomy from a perspective of algorithmic commonality. The taxonomy exploits algorithm-inherent characterization to imply a categorization of domain-specific architecture in the initial phase of ...

Article

FlexCore: A Reconfigurable Processor Supporting Flexible, Dynamic Morphing

Pages 30–39https://doi.org/10.1109/HiPC.2015.37

In the realm of desktop and server class processors, the prevailing trend is to use out-of-order superscalar cores that exploit the hidden instruction-level parallelism in a program. In superscalar designs, the performance (as measured by the IPC, ...

Article

High Efficiency Generalized Parallel Counters for Xilinx FPGAs

Pages 40–46https://doi.org/10.1109/HiPC.2015.41

Generalized Parallel Counters (GPCs) are frequently used in constructing high speed compressor trees. Prior work on GPC synthesis using FPGAs has focused on utilizing the fast carry chain and mapping the logic onto LUTs. This mapping is not optimal in ...

Article

2QW-Clock: An Efficient SSD Buffer Management Algorithm

Pages 47–53https://doi.org/10.1109/HiPC.2015.21

Modern solid state disk (SSD) has a buffer (SDRAM), which is used to store commonly used data and map in the near future. How to efficient management of this buffer is an important things of improving performance of SSD. Flash read and write speed have ...

Article

Task-Based Multifrontal QR Solver for GPU-Accelerated Multicore Architectures

Pages 54–63https://doi.org/10.1109/HiPC.2015.27

Recent studies have shown the potential of task-based programming paradigms for implementing robust, scalable sparse direct solvers for modern computing platforms. Yet, designing task flows that efficiently exploit heterogeneous architectures remains ...

Article

Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices

Pages 64–74https://doi.org/10.1109/HiPC.2015.55

Sparse matrix vector multiplication (SpMV) is an important linear algebra primitive. Recent research has focused on improving the performance of SpMV on GPUs when using compressed sparse row (CSR), the most frequently used matrix storage format on CPUs. ...

Article

On the Resilience of Parallel Sparse Hybrid Solvers

Pages 75–84https://doi.org/10.1109/HiPC.2015.9

As the computational power of high performance computing (HPC) systems continues to increase by using a huge number of CPU cores or specialized processing units, extreme-scale applications are increasingly prone to faults. Consequently, the HPC ...

Article

New Tridiagonal Systems Solvers on GPU Architectures

Pages 85–94https://doi.org/10.1109/HiPC.2015.17

Modern GPUs (Graphics Processing Units) offer very high computing power at relatively low cost. Nevertheless, designing efficient algorithms for the GPUs usually requires additional time and effort, even for experienced programmers. On the other hand, ...

Article

A Stable Parallel Algorithm for Diagonally Dominant Tridiagonal Linear Systems

Pages 95–104https://doi.org/10.1109/HiPC.2015.31

In this work, we present a stable parallel algorithm based on WZ factorization for solving diagonally dominant tridiagonal linear system of algebraic equations, using divide and conquer approach. Existence results are given and the backward error ...

Article

Optimizing Approximate Weighted Matching on Nvidia Kepler K40

Pages 105–114https://doi.org/10.1109/HiPC.2015.15

Matching is a fundamental graph problem with numerous applications in science and engineering. While algorithms for computing optimal matchings are difficult to parallelize, approximation algorithms on the other hand generally compute high quality ...

Article

Improving Communication Throughput by Multipath Load Balancing on Blue Gene/Q

Pages 115–124https://doi.org/10.1109/HiPC.2015.44

Achievable networking performance of applications in a supercomputer depends on the exact combination of the communication patterns of the applications and the routing algorithms used by the supercomputer. In order to achieve the highest networking ...

Article

Dynamic Adaptation for Elastic System Services Using Virtual Servers

Pages 125–134https://doi.org/10.1109/HiPC.2015.46

A vast majority of legacy runtime systems and middleware prevalent in cluster and supercomputing environments are static in nature. Due to the rising scale and complexity of high-performance computing systems, the static nature of systems software would ...

Article

Understanding the Performance Benefit of Asynchronous Data Transfers in OpenCL Programs Executing on Media Processors

Pages 135–144https://doi.org/10.1109/HiPC.2015.14

In this work, we study the performance benefits of using asynchronous data transfers in OpenCL programs executing on media processors. Asynchronous data transfers are typically implemented by use of Direct Memory Access (DMA) engines that can be ...

Article

Hardware-Transactional-Memory Based Speculative Parallel Discrete Event Simulation of Very Fine Grain Models

Pages 145–154https://doi.org/10.1109/HiPC.2015.45

This article presents an innovative runtime support for speculative parallel processing of discrete event simulation models on multi-core architectures, which exploits Hardware-Transactional-Memory (HTM) facilities for the purpose of state ...

Article

Towards Practical Page Placement for a Green Memory Manager

Pages 155–164https://doi.org/10.1109/HiPC.2015.42

Increased performance demand of modern applications has resulted in large memory modules and higher performance processors in computing systems. Power consumption becomes an important aspect when these resources go underutilized in a running system, ...

Article

Efficient Barrier Implementation on the POWER8 Processor

Pages 165–173https://doi.org/10.1109/HiPC.2015.51

POWER8 is a new generation of POWER processor capable of 8-way simultaneous multi-threading per core. High-performance computing capabilities, such as high amount of instruction-level and thread level parallelism, are integrated with a deep memory ...

Article

Compilers and the Furture of High Performance Computing

David Padua

Page 174https://doi.org/10.1109/HiPC.2015.60

Compiler technology has enabled the software advances of the last sixty years. It has given us machine-independent programming and improved productivity by automatically handling a number of issues, such as instruction selection and register allocation. ...

Article

On Accelerating Concurrent PCA Computations for Financial Risk Applications

Pages 175–184https://doi.org/10.1109/HiPC.2015.22

Principal component analysis (PCA) is a widely used mathematical technique for dimensionality reduction that works by identifying a smaller number of linearly uncorrelated variables (principal components) to explain the variation found in a data set. ...

Browse Proceedings

Sections

[Front cover]

[Title page i]

[Title page iii]

[Copyright notice]

Message from the General Co-chairs and Vice-General Co-chairs

Message from the Program Chair

Message from the Steering Chair

HiPC 2015 Committees

HiPC 2015 Technical Program

Scale-out Beyond Map-Reduce

Which Verification for Soft Error Detection?

Throughput Regulation in Shared Memory Multicore Processors

Application Taxonomy via Algorithmic Commonality for Domain-Specific Architecture Desgin

FlexCore: A Reconfigurable Processor Supporting Flexible, Dynamic Morphing

High Efficiency Generalized Parallel Counters for Xilinx FPGAs

2QW-Clock: An Efficient SSD Buffer Management Algorithm

Task-Based Multifrontal QR Solver for GPU-Accelerated Multicore Architectures

Structural Agnostic SpMV: Adapting CSR-Adaptive for Irregular Matrices

On the Resilience of Parallel Sparse Hybrid Solvers

New Tridiagonal Systems Solvers on GPU Architectures

A Stable Parallel Algorithm for Diagonally Dominant Tridiagonal Linear Systems

Optimizing Approximate Weighted Matching on Nvidia Kepler K40

Improving Communication Throughput by Multipath Load Balancing on Blue Gene/Q

Dynamic Adaptation for Elastic System Services Using Virtual Servers

Understanding the Performance Benefit of Asynchronous Data Transfers in OpenCL Programs Executing on Media Processors

Hardware-Transactional-Memory Based Speculative Parallel Discrete Event Simulation of Very Fine Grain Models

Towards Practical Page Placement for a Green Memory Manager

Efficient Barrier Implementation on the POWER8 Processor

Compilers and the Furture of High Performance Computing

On Accelerating Concurrent PCA Computations for Financial Risk Applications

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

UbiMob '09: Proceedings of the 5th French-Speaking Conference on Mobility and Ubiquity Computing

Save to Binder

Sections

Save to Binder

Recommendations

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

UbiMob '09: Proceedings of the 5th French-Speaking Conference on Mobility and Ubiquity Computing