Minin et al., 2021 - Google Patents
Benchmarks of cuda-based GMRES solver for Toeplitz and Hankel matrices and applications to topology optimization of photonic componentsMinin et al., 2021
View PDF- Document ID
- 2457509314512813622
- Author
- Minin I
- Matveev S
- Fedorov M
- Zacharov I
- Rykovanov S
- Publication year
- Publication venue
- Computational Mathematics and Modeling
External Links
Snippet
Generalized Minimal Residual Method (GMRES) was benchmarked on many types of GPUs for solving linear systems based on dense and sparse matrices. However, there are still no GMRES implementation benchmarks on Tesla V100 compared to GTX 1080 Ti ones or even …
- 238000005457 optimization 0 title abstract description 24
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/14—Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
- G06F17/141—Discrete Fourier transforms
- G06F17/142—Fast Fourier transforms, e.g. using a Cooley-Tukey type algorithm
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/11—Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
- G06F17/12—Simultaneous equations, e.g. systems of linear equations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/16—Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/45—Exploiting coarse grain parallelism in compilation, i.e. parallelism between groups of instructions
- G06F8/456—Parallelism detection
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
- G06F8/45—Exploiting coarse grain parallelism in compilation, i.e. parallelism between groups of instructions
- G06F8/451—Code distribution
- G06F8/452—Loops
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103631761B (en) | Parallel processing architecture carries out matrix operation and for the method for strict ripple coupling analysis | |
Bertaccini et al. | Iterative methods and preconditioning for large and sparse linear systems with applications | |
Breiten et al. | Low-rank solvers for fractional differential equations | |
Godwin et al. | High-performance sparse matrix-vector multiplication on GPUs for structured grid computations | |
Konyaev et al. | Computer simulation of optical wave propagation with the use of parallel programming | |
Bernaschi et al. | A factored sparse approximate inverse preconditioned conjugate gradient solver on graphics processing units | |
Pikle et al. | GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review | |
Minin et al. | Benchmarks of cuda-based GMRES solver for Toeplitz and Hankel matrices and applications to topology optimization of photonic components | |
Ibeid et al. | Fast multipole preconditioners for sparse matrices arising from elliptic equations | |
Lu et al. | Tilesptrsv: a tiled algorithm for parallel sparse triangular solve on gpus | |
Herholz et al. | Sparsity-specific code optimization using expression trees | |
Chakkour | Parallel computation to bidimensional heat equation using MPI/CUDA and FFTW package | |
AlOnazi et al. | Asynchronous task-based parallelization of algebraic multigrid | |
Hidayetoglu et al. | A fast and massively-parallel inverse solver for multiple-scattering tomographic image reconstruction | |
Ashari et al. | A model-driven blocking strategy for load balanced sparse matrix–vector multiplication on GPUs | |
Luo et al. | A fine-grained block ILU scheme on regular structures for GPGPUs | |
Chen et al. | A matrix-free parallel solution method for the three-dimensional heterogeneous Helmholtz equation | |
Chen et al. | HPCG: preliminary evaluation and optimization on Tianhe-2 CPU-only nodes | |
Ljungkvist et al. | Multigrid for matrix-free finite element computations on graphics processors | |
Magee et al. | Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition | |
Li et al. | A parallel structured banded DC algorithm for symmetric eigenvalue problems | |
JP2023544290A (en) | Determination and use of spectral embedding of large systems by substructuring | |
Kochurov et al. | GPU implementation of Jacobi Method and Gauss-Seidel Method for Data Arrays that Exceed GPU-dedicated Memory Size | |
Genovese et al. | Wavelet‐Based Density Functional Theory on Massively Parallel Hybrid Architectures | |
Mahfoudhi et al. | Parallel triangular matrix system solving on CPU-GPU system |