Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleSeptember 2024
An Evaluation Framework for Dynamic Thermal Management Strategies in 3D MultiProcessor System-on-Chip Co-Design
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 11Pages 2161–2176https://doi.org/10.1109/TPDS.2024.3459414Dynamic thermal management (DTM) has been widely adopted to improve the energy efficiency, reliability, and performance of modern Multi-Processor SoCs (MPSoCs). However, the evolving industry trends and heterogeneous architecture designs have introduced ...
- research-articleJuly 2024
Long-Range MD Electrostatics Force Computation on FPGAs
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 10Pages 1690–1707https://doi.org/10.1109/TPDS.2024.3434347Strong scaling of long-range electrostatic force computation, which is a central concern of long timescale molecular dynamics simulations, is challenging for CPUs and GPUs due to its complex communication structure and global communication requirements. ...
- research-articleNovember 2023
A High-Performance Genomic Accelerator for Accurate Sequence-to-Graph Alignment Using Dynamic Programming Algorithm
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 2Pages 237–249https://doi.org/10.1109/TPDS.2023.3325137The rapid mutation of viruses, such as SARS-CoV-2, highlights the urgent need for fast and precise genomic sequencing. The traditional sequencing technique maps the DNA fragments collected from an individual to a known linear reference genome sequence. ...
- research-articleOctober 2023
Redesign and Accelerate the AIREBO Bond-Order Potential on the New Sunway Supercomputer
- Ping Gao,
- Xiaohui Duan,
- Bertil Schmidt,
- Wubing Wan,
- Jiaxu Guo,
- Wusheng Zhang,
- Lin Gan,
- Haohuan Fu,
- Wei Xue,
- Weiguo Liu,
- Guangwen Yang
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 34, Issue 12Pages 3117–3132https://doi.org/10.1109/TPDS.2023.3321927Molecular dynamics (MD) is one of the most crucial computer simulation methods for understanding real-world processes at the atomic level. Reactive potentials based on the bond order concept have the ability to model dynamic bond breaking and formation ...
- research-articleDecember 2022
Increasing the Efficiency of Massively Parallel Sparse Matrix-Matrix Multiplication in First-Principles Calculation on the New-Generation Sunway Supercomputer
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 12Pages 4752–4766https://doi.org/10.1109/TPDS.2022.3202518The first-principles approach based on density-functional theory (DFT)/density-functional perturbation theory (DFPT) is widely used in calculations of the systems’ ground state energy, response properties (e.g., polarizability, phonon dispersions) ...
-
- research-articleDecember 2022
Redesigning and Optimizing UCSF DOCK3.7 on Sunway TaihuLight
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 12Pages 4458–4471https://doi.org/10.1109/TPDS.2022.3194916Molecular docking is the process of posing, scoring, and ranking small molecules at the binding sites of proteins to prioritize compounds for experimental testing. It is a widely-used computational method in the drug discovery process. However, it is a ...
- research-articleDecember 2022
Enabling Large Scale Simulations for Particle Accelerators
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 12Pages 4425–4439https://doi.org/10.1109/TPDS.2022.3192707International high-energy particle physics research centers, like CERN and Fermilab, require excessive studies and simulations to plan for the upcoming upgrades of the world's largest particle accelerators, and the design of future machines given ...
- research-articleAugust 2022
A GPU-Oriented Application Programming Interface for Digital Audio Workstations
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 8Pages 1924–1938https://doi.org/10.1109/TPDS.2021.3131659A Digital Audio Workstation (DAW) is a hardware and/or software device aiming to ease those operations required for music production, such as arranging, recording, editing, mixing, and, more in general, modifying sounds creatively. A peculiarity of a DAW ...
- research-articleJuly 2022
Hamiltonian Paths of <inline-formula><tex-math notation="LaTeX">$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic xlink:href="yang-ieq1-3126254.gif"/></alternatives></inline-formula>-ary <inline-formula><tex-math notation="LaTeX">$n$</tex-math><alternatives><mml:math><mml:mi>n</mml:mi></mml:math><inline-graphic xlink:href="yang-ieq2-3126254.gif"/></alternatives></inline-formula>-cubes Avoiding Faulty Links and Passing Through Prescribed Linear Forests
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 7Pages 1752–1760https://doi.org/10.1109/TPDS.2021.3126254The <inline-formula><tex-math notation="LaTeX">$k$</tex-math><alternatives><mml:math><mml:mi>k</mml:mi></mml:math><inline-graphic xlink:href="yang-ieq3-3126254.gif"/></alternatives></inline-formula>-ary <inline-formula><tex-math notation="LaTeX">$n$</tex-...
- research-articleMay 2022
Addictive Incentive Mechanism in Crowdsensing From the Perspective of Behavioral Economics
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 5Pages 1109–1127https://doi.org/10.1109/TPDS.2021.3104247In mobile crowdsensing, many mobile devices are collectively used to complete complex sensing tasks. Most tasks require users to consume resources to ensure continuous performance over multiple periods of time. Therefore, it is important to incentivize ...
- research-articleFebruary 2022
Protein Structured Reservoir Computing for Spike-Based Pattern Recognition
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 2Pages 322–331https://doi.org/10.1109/TPDS.2021.3068826Nowadays we witness a miniaturisation trend in the semiconductor industry backed up by groundbreaking discoveries and designs in nanoscale characterisation and fabrication. To facilitate the trend and produce ever smaller, faster and cheaper computing ...
- research-articleFebruary 2022
Inferring the Dynamics of the State Evolution During Quantum Annealing
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 33, Issue 2Pages 310–321https://doi.org/10.1109/TPDS.2020.3044846To solve an optimization problem using a commercial quantum annealer, one has to represent the problem of interest as an Ising or a quadratic unconstrained binary optimization (QUBO) problem and submit its coefficients to the annealer, which then returns ...
- research-articleMay 2021
PaKman: A Scalable Algorithm for Generating Genomic Contigs on Distributed Memory Machines
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 32, Issue 5Pages 1191–1209https://doi.org/10.1109/TPDS.2020.3043241<italic>De novo</italic> genome assembly is a fundamental problem in the field of bioinformatics, that aims to assemble the DNA sequence of an unknown genome from numerous short DNA fragments (aka reads) obtained from it. With the advent of high-...
- research-articleDecember 2020
Congestion-Balanced and Welfare-Maximized Charging Strategies for Electric Vehicles
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 31, Issue 12Pages 2882–2895https://doi.org/10.1109/TPDS.2020.3003270With the increase of the number of electric vehicles (EVs), it is of vital importance to develop the efficient and effective charging scheduling schemes for all the EVs. In this article, we aim to maximize the social welfare of all the EVs, charging ...
- research-articleNovember 2020
Correlation of Performance Optimizations and Energy Consumption for Stencil-Based Application on Intel Xeon Scalable Processors
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 31, Issue 11Pages 2582–2593https://doi.org/10.1109/TPDS.2020.2996314This article provides a comprehensive study of the impact of performance optimizations on the energy efficiency of a real-world CFD application called MPDATA, as well as an insightful analysis of performance-energy interaction of these optimizations with ...
- research-articleJanuary 2020
Quantum Game Analysis on Extrinsic Incentive Mechanisms for P2P Services
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 31, Issue 1Pages 159–170https://doi.org/10.1109/TPDS.2019.2933416Peer-to-peer (P2P) services such as mobile P2P transmissions and resource sharing, provide efficient methods to deliver data without the deployment of any central server. Nevertheless, the <italic>free-riding</italic> phenomenon inherit in such services ...
- research-articleNovember 2019
Accelerating Atmospheric Chemical Kinetics for Climate Simulations
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 30, Issue 11Pages 2396–2407https://doi.org/10.1109/TPDS.2019.2918798The study of atmospheric chemistry-climate interactions is one of today's great computational challenges. Advances in the architecture of Graphics Processing Units (GPUs) in both raw computational power and memory bandwidth sparked the interest for ...
- research-articleSeptember 2019
A Performance Model for GPU Architectures that Considers On-Chip Resources: Application to Medical Image Registration
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 30, Issue 9Pages 1947–1961https://doi.org/10.1109/TPDS.2019.2905213Graphics processing units (GPUs) have become extremely important devices for accelerating computing performance in many applications. However, there have been few accurate models to estimate the performance of such applications running on modern GPUs. In ...
- research-articleNovember 2018
mSNP: A Massively Parallel Algorithm for Large-Scale SNP Detection
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 29, Issue 11Pages 2557–2567https://doi.org/10.1109/TPDS.2018.2839578Single Nucleotide Polymorphism (SNP) detection is a fundamental procedure of whole genome analysis. SOAPsnp, a classic tool for detection, would take more than one week to analyze one typical human genome, which limits the efficiency of downstream ...