Cited By
View all- Kudo SImamura T(2019)Cache-efficient implementation and batching of tridiagonalization on manycore CPUsProceedings of the International Conference on High Performance Computing in Asia-Pacific Region10.1145/3293320.3293329(71-80)Online publication date: 14-Jan-2019
- Rodriguez-Gutiez EMoreton-Fernandez AGonzalez-Escribano ALlanos D(2019)Toward a BLAS library truly portable across different accelerator typesThe Journal of Supercomputing10.1007/s11227-019-02925-3Online publication date: 10-Jun-2019
- Dongarra JGates MHaidar AKurzak JLuszczek PTomov SYamazaki I(2018)The Singular Value Decomposition: Anatomy of Optimizing an Algorithm for Extreme ScaleSIAM Review10.1137/17M111773260:4(808-865)Online publication date: 8-Nov-2018
- Show More Cited By