Nothing Special   »   [go: up one dir, main page]

Minin et al., 2021 - Google Patents

Benchmarks of cuda-based GMRES solver for Toeplitz and Hankel matrices and applications to topology optimization of photonic components

Minin et al., 2021

View PDF
Document ID
2457509314512813622
Author
Minin I
Matveev S
Fedorov M
Zacharov I
Rykovanov S
Publication year
Publication venue
Computational Mathematics and Modeling

External Links

Snippet

Generalized Minimal Residual Method (GMRES) was benchmarked on many types of GPUs for solving linear systems based on dense and sparse matrices. However, there are still no GMRES implementation benchmarks on Tesla V100 compared to GTX 1080 Ti ones or even …
Continue reading at cplire.ru:8080 (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/14Fourier, Walsh or analogous domain transformations, e.g. Laplace, Hilbert, Karhunen-Loeve, transforms
    • G06F17/141Discrete Fourier transforms
    • G06F17/142Fast Fourier transforms, e.g. using a Cooley-Tukey type algorithm
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/11Complex mathematical operations for solving equations, e.g. nonlinear equations, general mathematical optimization problems
    • G06F17/12Simultaneous equations, e.g. systems of linear equations
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/50Computer-aided design
    • G06F17/5009Computer-aided design using simulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code
    • G06F8/41Compilation
    • G06F8/45Exploiting coarse grain parallelism in compilation, i.e. parallelism between groups of instructions
    • G06F8/456Parallelism detection
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformations of program code
    • G06F8/41Compilation
    • G06F8/45Exploiting coarse grain parallelism in compilation, i.e. parallelism between groups of instructions
    • G06F8/451Code distribution
    • G06F8/452Loops
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06FELECTRICAL DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for programme control, e.g. control unit
    • G06F9/06Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources

Similar Documents

Publication Publication Date Title
CN103631761B (en) Parallel processing architecture carries out matrix operation and for the method for strict ripple coupling analysis
Bertaccini et al. Iterative methods and preconditioning for large and sparse linear systems with applications
Breiten et al. Low-rank solvers for fractional differential equations
Godwin et al. High-performance sparse matrix-vector multiplication on GPUs for structured grid computations
Konyaev et al. Computer simulation of optical wave propagation with the use of parallel programming
Bernaschi et al. A factored sparse approximate inverse preconditioned conjugate gradient solver on graphics processing units
Pikle et al. GPGPU-based parallel computing applied in the FEM using the conjugate gradient algorithm: a review
Minin et al. Benchmarks of cuda-based GMRES solver for Toeplitz and Hankel matrices and applications to topology optimization of photonic components
Ibeid et al. Fast multipole preconditioners for sparse matrices arising from elliptic equations
Lu et al. Tilesptrsv: a tiled algorithm for parallel sparse triangular solve on gpus
Herholz et al. Sparsity-specific code optimization using expression trees
Chakkour Parallel computation to bidimensional heat equation using MPI/CUDA and FFTW package
AlOnazi et al. Asynchronous task-based parallelization of algebraic multigrid
Hidayetoglu et al. A fast and massively-parallel inverse solver for multiple-scattering tomographic image reconstruction
Ashari et al. A model-driven blocking strategy for load balanced sparse matrix–vector multiplication on GPUs
Luo et al. A fine-grained block ILU scheme on regular structures for GPGPUs
Chen et al. A matrix-free parallel solution method for the three-dimensional heterogeneous Helmholtz equation
Chen et al. HPCG: preliminary evaluation and optimization on Tianhe-2 CPU-only nodes
Ljungkvist et al. Multigrid for matrix-free finite element computations on graphics processors
Magee et al. Accelerating solutions of one-dimensional unsteady PDEs with GPU-based swept time–space decomposition
Li et al. A parallel structured banded DC algorithm for symmetric eigenvalue problems
JP2023544290A (en) Determination and use of spectral embedding of large systems by substructuring
Kochurov et al. GPU implementation of Jacobi Method and Gauss-Seidel Method for Data Arrays that Exceed GPU-dedicated Memory Size
Genovese et al. Wavelet‐Based Density Functional Theory on Massively Parallel Hybrid Architectures
Mahfoudhi et al. Parallel triangular matrix system solving on CPU-GPU system