Proceedings of the 52nd International Conference on Parallel Processing Workshops

ICPP Workshops '23: Proceedings of the 52nd International Conference on Parallel Processing Workshops

August 2023

2023 Proceeding

Publisher:

Association for Computing Machinery
New York
NY
United States

Conference:

ICPP-W 2023: 52nd International Conference on Parallel Processing Workshops Salt Lake City UT USA August 7 - 10, 2023

ISBN:

979-8-4007-0842-8

Published:

07 September 2023

Recommend ACM DL

ALREADY A SUBSCRIBER?SIGN IN

Bibliometrics

Abstract

No abstract available.

Proceeding Downloads

PDFFront matter (Welcome Messages, Committees, Table of Contents)

Select All

Export Citations Save to Binder

SESSION: International Workshop on Applications of Wireless Ad hoc and Sensor Networks

research-article

Taillight Signal Recognition via Sequential Learning

Pages 1–7https://doi.org/10.1145/3605731.3605872

In autonomous driving, it is crucial to capture the driving intentions of other vehicles on the road, which can then be used for the autonomous driving vehicle to plan a safe route. This study proposes a system to identify the driving intention of other ...

research-article

Utility-Based Task Assignment for ON-based Mobile Crowdsourcing

Pages 8–14https://doi.org/10.1145/3605731.3605873

In this paper we are interested in opportunistic network-based (ON-based) mobile crowdsourcing (MCS), where a requester (called a server) assigns a set of tasks to a pool of workers, and the workers process the assigned tasks for payoff. The key to ...

research-article

Develop an AIoT Badminton Serving Machine

Pages 15–21https://doi.org/10.1145/3605731.3605882

An AIoT badminton serving machine refers to a shuttlecock serving machine with an IoT wireless interface that can support the integration with computer vision and intelligent computing to develop innovative applications to improve teaching and learning ...

research-article

Badminton Shot Event Detection and Feature Calculation from 3D Rally Video

Pages 22–29https://doi.org/10.1145/3605731.3605883

The technical performance of badminton players in games can be evaluated based on the performance indices of each shot. The most representative shot performance indices are ball speed, outgoing angle, and ball type usage. In this study, a stereoscopic ...

SESSION: The 3rd International Workshop on Deployment and Use of Accelerators

research-article

Public Access

Experience Deploying Graph Applications on GPUs with SYCL

Pages 30–39https://doi.org/10.1145/3605731.3605744

SYCL allows for deployment and use of accelerators across vendors’ platforms. In this work, we describe the experience of deploying graph analytics on vendors’ GPUs using SYCL. We contrast the CUDA and SYCL application programming interfaces by ...

research-article

Open Access

Evaluating Accelerators for a High-Throughput Hash-Based Security Protocol

Pages 40–49https://doi.org/10.1145/3605731.3605745

Security threats are rising due to widely available computational power and near-future quantum computers. New cryptographic protocols have been developed to address these challenges, but very few protocols take advantage of parallel computing. In this ...

research-article

Open Access

A Bucket-aware Asynchronous Single-Source Shortest Path Algorithm on GPU

Pages 50–60https://doi.org/10.1145/3605731.3605746

Single-Source Shortest Path (SSSP) algorithm is a common routine in graph processing and has been extensively studied on Graphics Processing Unit (GPU). Despite the powerful parallelism resources and high memory bandwidth provided by GPU, the ...

research-article

NFCache: Fine-grained and Flexible Offloading of Network Functions to Programmable Switches

Pages 61–70https://doi.org/10.1145/3605731.3605747

A Service Function Chain (SFC) consists of a sequence of Network Functions (NFs) in order, and plays an important role in network performance and security. In recent years, due to the low throughput and high latency of network functions in the context ...

SESSION: The 2023 International Workshop on Embedded Multicore Systems

research-article

Enhanced Memory Corruption Detection in C/C++ Programs

Pages 71–78https://doi.org/10.1145/3605731.3605903

Out-of-bound memory accesses, which often occur in programs written in unsafe languages such as C or C++, cause severe troubles. Though there are many useful tools aiming at this problem, we report a new tool, called mcds, for detecting spatial and ...

research-article

Enhancing LLVM Optimizations for Linear Recurrence Programs on RVV

Pages 79–87https://doi.org/10.1145/3605731.3605904

The RISC-V Vector Extension (RVV) has emerged as a promising vector architecture for high-performance computing. It enables parallel computing capability for RISC-V CPUs by introducing additional vector instructions and vector registers. To fully ...

research-article

Support of Sparse Tensor Computing for MLIR HLS

Pages 88–95https://doi.org/10.1145/3605731.3605908

Nowadays, sparse tensor computations are widely used in machine learning. Without the multiplications in zero values, sparse tensor computation can significantly reduce the latency and power consumption. Famous frameworks like TensorFlow, PyTorch, ...

research-article

Open Access

Pointer Analysis for Programs on Hybrid DRAM-PM Memory Systems

Pages 96–103https://doi.org/10.1145/3605731.3605906

With the development of Non-Volatile Memory (NVM) technology, the practicality of byte-addressable persistent memory (PM) has become increasingly mature. The hybrid DRAM-PM memory systems have added diversity to program design and execution. In this ...

research-article

Mapping-Free GPU Offloading in OpenMP Using Unified Memory

Pages 104–111https://doi.org/10.1145/3605731.3605907

With the increasing demand for heterogeneous computing, OpenMP has introduced an offloading feature that allows programmers to offload a task to a device (e.g., a GPU or an FPGA) by adding appropriate directives to the task since version 4.0. Compared ...

SESSION: The Third Workshop on LLVM in Parallel Processing

research-article

Maximizing Parallelism and GPU Utilization For Direct GPU Compilation Through Ensemble Execution

Pages 112–118https://doi.org/10.1145/3605731.3606016

GPUs are renowned for their exceptional computational acceleration capabilities achieved through massive parallelism. However, utilizing GPUs for computation requires manual identification of code regions suitable for offloading, data transfer ...

SESSION: The Sixteenth International Workshop on Parallel Programming Models and Systems Software for High-end Computing

research-article

Measuring Thread Timing to Assess the Feasibility of Early-bird Message Delivery

Pages 119–126https://doi.org/10.1145/3605731.3605884

Early-bird communication is a communication/computation overlap technique that combines fine-grained communication with partitioned communication to improve application run-time. Communication is divided among the compute threads such that each ...

research-article

Public Access

Codelet Pipe: Realization of Dataflow Software Pipelining for Extended Codelet Model

Pages 127–135https://doi.org/10.1145/3605731.3605885

Dataflow Software Pipelining for Codelet Model is a coarse-grained code-mapping scheme designed to exploit pipelined parallelism across Codelets executing on different cores. The extended operational semantics of the Codelet model exploit pipelined ...

research-article

Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation

Pages 136–144https://doi.org/10.1145/3605731.3605886

We evaluate AI-assisted generative capabilities on fundamental numerical kernels in high-performance computing (HPC), including AXPY, GEMV, GEMM, SpMV, Jacobi Stencil, and CG. We test the generated kernel codes for a variety of language-supported ...

research-article

Open Access

DFCPP Runtime Library for Dataflow Programming

Pages 145–152https://doi.org/10.1145/3605731.3605887

The Dataflow for C++(DFCPP) designed and implemented in this paper is a parallel programming library for dataflow computing on a general control flow hardware platform. Compared with existing dataflow programming libraries, DFCPP has an easy-to-use user ...

research-article

Design and Implementation of Data Flow Programming Language DFC2

Pages 153–161https://doi.org/10.1145/3605731.3605888

This article presents the design and development of a dataflow programming language called DFC2 (DataFlow C, version 2) based on the dataflow programming model. The DFC2 compiler is responsible for converting the DFC2 language into C++ language, which is ...

SESSION: The 3rd International Workshop on Parallel and Distributed Algorithms for Decision Sciences

research-article

Public Access

Combining a Parallel Branch-and-Bound Algorithm with a Strong Heuristic to Solve the Sequential Ordering Problem

Pages 162–166https://doi.org/10.1145/3605731.3608929

In this paper, we describe how to combine a parallel branch-and-bound (B&B) algorithm and a strong heuristic to solve the Sequential Ordering Problem (SOP), which is an NP-hard optimization problem. A parallel B&B algorithm is run in parallel with the ...

research-article

Public Access

A GPU-Accelerated Population Generation, Sorting, and Mutation Kernel for an Optimization-Based Causal Inference Model

Pages 167–171https://doi.org/10.1145/3605731.3608930

We develop a GPU-accelerated machine learning generative adversarial network model that can be used with observational data for the purpose of constructing causal inferences. The theoretical basis of our machine learning model is novel and is ...

research-article

Multiobjective Hyperparameter Optimization for Deep Learning Interatomic Potential Training Using NSGA-II

Pages 172–179https://doi.org/10.1145/3605731.3608931

Deep neural network (DNN) potentials are an emerging tool for simulation of dynamical atomistic systems, with the promise of quantum mechanical accuracy at speedups of 10000 ×. As with other DNN methods, hyperparameters used during training can make a ...

research-article

Polar Representation of 2D Image Using Complex Exponential Spiking Neuron Network

Lei Zhang

Pages 180–186https://doi.org/10.1145/3605731.3608932

The paper introduces an innovative hybrid encoding method for images. It proposes a conversion process where the image is transformed from the conventional Cartesian coordinates representation (x and y) to a polar coordinates representation using ...

research-article

A new paradigm for forest fire spread prediction: Faster decisions at high resolution

Pages 187–191https://doi.org/10.1145/3605731.3608933

Climate change has led to a significant increase in the number of wildfire events and their severity. To mitigate their impact, it is necessary to be able to make quick decisions according to the fire behavior. In order to assist on these decisions, we ...

SESSION: The First International Workshop on Scalable Algorithms for Network Dynamics

short-paper

Open Access

Coordinated Botnet Detection in Social Networks via Clustering Analysis

Pages 192–196https://doi.org/10.1145/3605731.3608959

Graphs are a widely used tool in modeling social interaction networks. In a network that consists of authors and pages with time-stamped interactions between one page and one author, we can model the network as a bipartite temporal graph. These graphs ...

Index Terms

Proceedings of the 52nd International Conference on Parallel Processing Workshops

Index terms have been assigned to the content through auto-classification.

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Recommendations

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Year	Submitted	Accepted	Rate
ICPP '18	313	91	29%
Overall	313	91	29%

ICPP

Sections

Proceeding Downloads

Taillight Signal Recognition via Sequential Learning

Utility-Based Task Assignment for ON-based Mobile Crowdsourcing

Develop an AIoT Badminton Serving Machine

Badminton Shot Event Detection and Feature Calculation from 3D Rally Video

Experience Deploying Graph Applications on GPUs with SYCL

Evaluating Accelerators for a High-Throughput Hash-Based Security Protocol

A Bucket-aware Asynchronous Single-Source Shortest Path Algorithm on GPU

NFCache: Fine-grained and Flexible Offloading of Network Functions to Programmable Switches

Enhanced Memory Corruption Detection in C/C++ Programs

Enhancing LLVM Optimizations for Linear Recurrence Programs on RVV

Support of Sparse Tensor Computing for MLIR HLS

Pointer Analysis for Programs on Hybrid DRAM-PM Memory Systems

Mapping-Free GPU Offloading in OpenMP Using Unified Memory

Maximizing Parallelism and GPU Utilization For Direct GPU Compilation Through Ensemble Execution

Measuring Thread Timing to Assess the Feasibility of Early-bird Message Delivery

Codelet Pipe: Realization of Dataflow Software Pipelining for Extended Codelet Model

Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation

DFCPP Runtime Library for Dataflow Programming

Design and Implementation of Data Flow Programming Language DFC2

Combining a Parallel Branch-and-Bound Algorithm with a Strong Heuristic to Solve the Sequential Ordering Problem

A GPU-Accelerated Population Generation, Sorting, and Mutation Kernel for an Optimization-Based Causal Inference Model

Multiobjective Hyperparameter Optimization for Deep Learning Interatomic Potential Training Using NSGA-II

Polar Representation of 2D Image Using Complex Exponential Spiking Neuron Network

A new paradigm for forest fire spread prediction: Faster decisions at high resolution

Coordinated Botnet Detection in Social Networks via Clustering Analysis

Index Terms

UbiMob '05: Proceedings of the 2nd French-speaking conference on Mobility and ubiquity computing

UbiMob '08: Proceedings of the 4th French-speaking conference on Mobility and ubiquity computing

IHM '09: Proceedings of the 21st International Conference on Association Francophone d'Interaction Homme-Machine

Acceptance Rates