Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3176364acmotherconferencesBook PagePublication PageshpcasiaConference Proceedingsconference-collections
HPCAsia '18 Workshops: Proceedings of Workshops of HPC Asia
ACM2018 Proceeding
Publisher:
  • Association for Computing Machinery
  • New York
  • NY
  • United States
Conference:
HPC Asia 2018 WS: Workshops of HPC Asia 2018 Chiyoda Tokyo 31 January 2018
ISBN:
978-1-4503-6347-1
Published:
31 January 2018
Sponsors:
IPSJ
In-Cooperation:

Reflects downloads up to 18 Nov 2024Bibliometrics
Skip Abstract Section
Abstract

High performance computing (HPC) is, today, a key technology to solve large problems in science, engineering, and business by utilizing tremendous computing power which has been evolving to the future. HPC Asia, which is an international conference series on HPC technologies in Asia Pacific region, and was held in the past several times in several countries in Asia regional sites to discuss the issues on high performance computing and to exchange information of research and development results in HPC.

Skip Table Of Content Section
WORKSHOP SESSION: Workshop on PGAS programming models: experiances and implementations
research-article
Open Access
Performance evaluation for a hydrodynamics application in XcalableACC PGAS language for accelerated clusters

Clusters equipped with accelerators such as GPUs and MICs are used widely. To use these clusters, programmers write programs for their applications by combining MPI with one of the accelerator programming models such as CUDA and OpenACC. The accelerator ...

research-article
Investigating the performance and productivity of DASH using the Cowichan problems

DASH is a new realization of the PGAS (Partitioned Global Address Space) programming model in the form of a C++ template library. Instead of using a custom compiler, DASH provides expressive programming constructs using C++ abstraction mechanisms and ...

research-article
Recent experiences in using MPI-3 RMA in the DASH PGAS runtime

The Partitioned Global Address Space (PGAS) programming model has become a viable alternative to traditional message passing using MPI. The DASH project provides a PGAS abstraction entirely based on C++11. The underlying DASH RunTime, DART, provides ...

research-article
Towards a parallel algebraic multigrid solver using PGAS

The Algebraic Multigrid (AMG) method has over the years developed into an efficient tool for solving unstructured linear systems. The need to solve large industrial problems discretized on unstructured meshes, has been a key motivation for devising a ...

research-article
Linkage of XcalableMP and Python languages for high productivity on HPC cluster system: application to graph order/degree problem

When developing applications on high-performance computing (HPC) cluster systems, Partitioned Global Address Space (PGAS) languages are used due to their high productivity and performance. However, in order to more efficiently develop such applications, ...

WORKSHOP SESSION: IXPUG workshop asia 2018
short-paper
Optimizing two-electron repulsion integral calculation on knights landing architecture

In this paper, we introduced some optimization methods we used to optimize two-electron repulsion integral calculation on Knights Landing architecture. We developed a schedule for parallelism and vectorization, and we compared two different methods for ...

research-article
Performance evaluation for omni XcalableMP compiler on many-core cluster system based on knights landing

To reduce the programming cost on a cluster system, Partitioned Global Address Space (PGAS) languages are used. We have designed an XcalableMP (XMP) PGAS language and developed the Omni XMP compiler (Omni compiler) for XMP. In the present study, we ...

short-paper
Open Access
Scaling collectives on large clusters using Intel(R) architecture processors and fabric

This paper provides results on scaling Barrier and Allreduce to 8192 nodes on a cluster of Intel® Xeon Phi™ processors installed at the University of Tokyo and the University of Tsukuba. We will describe the effects of OS and platform noise on the ...

short-paper
OpenMP-based parallel implementation of matrix-matrix multiplication on the intel knights landing

The second generation Intel Xeon Phi processor codenamed Knights Landing (KNL) have emerged with 2D tile mesh architecture. Implementing of the general matrix-matrix multiplication on a new architecture is an important practice. To date, there has not ...

short-paper
Multiple endpoints for improved MPI performance on a lattice QCD code

This paper provides results using multiple threads and a high-performance MPI implementation of MPI_THREAD_MULTIPLE applied to a Lattice QCD Code (CCS-QCD) and run on the Oakforest-PACS machine. Performance has improved from the baseline code by as much ...

short-paper
Optimizing a particle-in-cell code on Intel knights landing

The particle-in-cell (PIC) code is one of the mainstream algorithms in the laser plasma research area. However, the programming challenges to achieve high performance of PIC codes on the Intel Knights Landing (KNL) processor is widely concerned by global ...

Contributors
  • The University of Tokyo
Index terms have been assigned to the content through auto-classification.
Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Acceptance Rates

Overall Acceptance Rate 69 of 143 submissions, 48%
YearSubmittedAcceptedRate
HPCAsia '23341544%
HPCAsia '23 Workshops10990%
HPCAsia '19321547%
HPCAsia '18673045%
Overall1436948%