Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJuly 2024
Parallel implementation of discrete cosine transform and its inverse for image compression applications
The Journal of Supercomputing (JSCO), Volume 80, Issue 16Pages 23712–23735https://doi.org/10.1007/s11227-024-06343-yAbstractThis paper presents the graphics processing unit (GPU) implementation of two-dimensional discrete cosine transform (2D DCT) and inverse discrete cosine transform (2D IDCT) for image compression applications. Based on the trigonometric properties, ...
- research-articleJune 2024
A high-performance, parallel, and hierarchically distributed model for coastal run-up events simulation and forecasting
- Diana Di Luccio,
- Ciro Giuseppe De Vita,
- Aniello Florio,
- Gennaro Mellone,
- Catherine Alessandra Torres Charles,
- Guido Benassai,
- Raffaele Montella
The Journal of Supercomputing (JSCO), Volume 80, Issue 15Pages 22748–22769https://doi.org/10.1007/s11227-024-06188-5AbstractThe request for quickly available forecasts of intense weather and marine events impacting coastal areas is gradually increasing. High-performance computing (HPC) and artificial intelligence techniques are crucial in this application. Risk ...
- research-articleJune 2024
Enhancing self-adaptation for efficient decision-making at run-time in streaming applications on multicores
The Journal of Supercomputing (JSCO), Volume 80, Issue 15Pages 22213–22244https://doi.org/10.1007/s11227-024-06191-wAbstractParallel computing is very important to accelerate the performance of computing applications. Moreover, parallel applications are expected to continue executing in more dynamic environments and react to changing conditions. In this context, ...
-
- research-articleJune 2024
A CUDA-based parallel optimization method for SM3 hash algorithm
The Journal of Supercomputing (JSCO), Volume 80, Issue 14Pages 21431–21446https://doi.org/10.1007/s11227-024-06141-6AbstractHash algorithms are among the most crucial algorithms in cryptography. The SM3 algorithm is a hash cryptographic standard of China. Because of the strong collision resistance and irreversibility of hash algorithms, they are widely used as a basic ...
- research-articleJune 2024
OpenMP offload toward the exascale using Intel® GPU Max 1550: evaluation of STREAmS compressible solver
The Journal of Supercomputing (JSCO), Volume 80, Issue 14Pages 21094–21127https://doi.org/10.1007/s11227-024-06254-yAbstractNearly 20 years after the birth of general-purpose GPU computing, the HPC landscape is now dominated by GPUs. After years of undisputed dominance by NVIDIA, new players have entered the arena in a convincing manner, namely AMD and more recently ...
- research-articleJune 2024
Design and performance evaluation of UCX for the Tofu Interconnect D on Fugaku towards efficient multithreaded communication
The Journal of Supercomputing (JSCO), Volume 80, Issue 14Pages 20715–20742https://doi.org/10.1007/s11227-024-06201-xAbstractThe increasing trend of manycore processors makes multithreaded communication more important to avoid costly global synchronization among cores. One of the representative approaches that require multithreaded communication is the global task-based ...
- research-articleMay 2024
Optimizing sparse general matrix–matrix multiplication for DCUs
- Hengliang Guo,
- Haolei Wang,
- Wanting Chen,
- Congxiang Zhang,
- Yubo Han,
- Shengguang Zhu,
- Dujuan Zhang,
- Yang Guo,
- Jiandong Shang,
- Tao Wan,
- Qingyang Li,
- Gang Wu
The Journal of Supercomputing (JSCO), Volume 80, Issue 14Pages 20176–20200https://doi.org/10.1007/s11227-024-06234-2AbstractSparse general matrix–matrix multiplication (SpGEMM) is a crucial and complex computational task in many practical applications. Improving the performance of SpGEMM on SIMT processors like modern GPUs is challenging due to the unpredictable ...
- research-articleJune 2023
Parallel program testing based on critical communication and branch transformation
The Journal of Supercomputing (JSCO), Volume 80, Issue 1Pages 519–548https://doi.org/10.1007/s11227-023-05460-4AbstractSoftware testing is an efficient way to guarantee the reliability and accuracy of parallel programs. Communication plays a substantial role in parallel program testing. The huge scale of communication within parallel programs poses a great ...
- research-articleJune 2023
Parallel algorithm design and optimization of geodynamic numerical simulation application on the Tianhe new-generation high-performance computer
The Journal of Supercomputing (JSCO), Volume 80, Issue 1Pages 331–362https://doi.org/10.1007/s11227-023-05469-9AbstractCitcomCu is a numerical simulation software for mantle convection in the field of geodynamics, which can simulate thermo-chemical convection in a three-dimensional domain. Due to the increasing demand for high-precision simulations and larger ...
- research-articleNovember 2022
- research-articleNovember 2022
Graph partitioning strategies: one size does not fit all
The Journal of Supercomputing (JSCO), Volume 78, Issue 17Pages 19272–19295https://doi.org/10.1007/s11227-022-04620-2AbstractAs an important part of distributed graph computing, graph partitioning has been widely studied. However, the majority of the existing approaches to distributed graph partitioning barely take into consideration the relationship between the ...
- research-articleNovember 2022
Scalable performance analysis method for SPMD applications
The Journal of Supercomputing (JSCO), Volume 78, Issue 17Pages 19346–19371https://doi.org/10.1007/s11227-022-04588-zAbstractThe analysis of parallel scientific applications allows us to understand their computational and communication behavior. One way of obtaining performance information is through performance tools. One such tool is parallel application signatures ...
- research-articleSeptember 2022
Fiuncho: a program for any-order epistasis detection in CPU clusters
The Journal of Supercomputing (JSCO), Volume 78, Issue 13Pages 15338–15357https://doi.org/10.1007/s11227-022-04477-5AbstractEpistasis can be defined as the statistical interaction of genes during the expression of a phenotype. It is believed that it plays a fundamental role in gene expression, as individual genetic variants have reported a very small increase in ...
- research-articleSeptember 2022
Image Sobel edge extraction algorithm accelerated by OpenCL
The Journal of Supercomputing (JSCO), Volume 78, Issue 14Pages 16236–16265https://doi.org/10.1007/s11227-022-04404-8AbstractAiming at the low processing speed of the Sobel edge detection algorithm and the equipment limitations of Compute Unified Device Architecture (CUDA) implementation algorithm acceleration, a Sobel edge detection parallel algorithm based on Open ...
- research-articleAugust 2022
Parallel SHA-256 on SW26010 many-core processor for hashing of multiple messages
The Journal of Supercomputing (JSCO), Volume 79, Issue 2Pages 2332–2355https://doi.org/10.1007/s11227-022-04750-7AbstractTo explore whether new parallelism techniques can provide additional performance improvements in cryptographic hash functions, we conducted our study with the SW26010, which is a special-architecture processor on Sunway TaihuLight, one of the ...