Minin et al., 2021 - Google Patents

Benchmarks of cuda-based GMRES solver for Toeplitz and Hankel matrices and applications to topology optimization of photonic components

Minin et al., 2021

View PDF

Document ID: 2457509314512813622
Author: Minin I; Matveev S; Fedorov M; Zacharov I; Rykovanov S
Publication year: 2021
Publication venue: Computational Mathematics and Modeling

External Links

Cited by

Snippet

Generalized Minimal Residual Method (GMRES) was benchmarked on many types of GPUs for solving linear systems based on dense and sparse matrices. However, there are still no GMRES implementation benchmarks on Tesla V100 compared to GTX 1080 Ti ones or even …

Continue reading at cplire.ru:8080 (PDF) (other versions)

238000005457 optimization 0 title abstract description 24

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/14—Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
- G06F17/141—Discrete Fourier transforms
- G06F17/142—Fast Fourier transforms, e.g. using a Cooley-Tukey type algorithm
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G06F17/12—Simultaneous equations, e.g. systems of linear equations
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/45—Exploiting coarse grain parallelism in compilation, i.e. parallelism between groups of instructions
- G06F8/456—Parallelism detection
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/45—Exploiting coarse grain parallelism in compilation, i.e. parallelism between groups of instructions
- G06F8/451—Code distribution
- G06F8/452—Loops
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources

Similar Documents

Publication	Publication Date	Title
CN103631761B (en)	2018-02-27	Parallel processing architecture carries out matrix operation and for the method for strict ripple coupling analysis
Bertaccini et al.	2018	Iterative methods and preconditioning for large and sparse linear systems with applications
Breiten et al.	2016	Low-rank solvers for fractional differential equations
Godwin et al.	2012	High-performance sparse matrix-vector multiplication on GPUs for structured grid computations
Konyaev et al.	2011	Computer simulation of optical wave propagation with the use of parallel programming
Bernaschi et al.	2016	A factored sparse approximate inverse preconditioned conjugate gradient solver on graphics processing units
Pikle et al.	2018	GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review
Minin et al.	2021	Benchmarks of cuda-based GMRES solver for Toeplitz and Hankel matrices and applications to topology optimization of photonic components
Ibeid et al.	2018	Fast multipole preconditioners for sparse matrices arising from elliptic equations
Lu et al.	2023	Tilesptrsv: a tiled algorithm for parallel sparse triangular solve on gpus
Herholz et al.	2022	Sparsity-specific code optimization using expression trees
Chakkour	2024	Parallel computation to bidimensional heat equation using MPI/CUDA and FFTW package
AlOnazi et al.	2017	Asynchronous task-based parallelization of algebraic multigrid
Hidayetoglu et al.	2018	A fast and massively-parallel inverse solver for multiple-scattering tomographic image reconstruction
Ashari et al.	2015	A model-driven blocking strategy for load balanced sparse matrix–vector multiplication on GPUs
Luo et al.	2015	A fine-grained block ILU scheme on regular structures for GPGPUs
Chen et al.	2023	A matrix-free parallel solution method for the three-dimensional heterogeneous Helmholtz equation
Chen et al.	2014	HPCG: preliminary evaluation and optimization on Tianhe-2 CPU-only nodes
Ljungkvist et al.	2017	Multigrid for matrix-free finite element computations on graphics processors
Magee et al.	2018	Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition
Li et al.	2023	A parallel structured banded DC algorithm for symmetric eigenvalue problems
JP2023544290A (en)	2023-10-23	Determination and use of spectral embedding of large systems by substructuring
Kochurov et al.	2015	GPU implementation of Jacobi Method and Gauss-Seidel Method for Data Arrays that Exceed GPU-dedicated Memory Size
Genovese et al.	2016	Wavelet‐Based Density Functional Theory on Massively Parallel Hybrid Architectures
Mahfoudhi et al.	2016	Parallel triangular matrix system solving on CPU-GPU system