Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
Algorithm 1029: Encapsulated Error, a Direct Approach to Evaluate Floating-Point Accuracy
ACM Transactions on Mathematical Software (TOMS), Volume 48, Issue 4Article No.: 47, Pages 1–16https://doi.org/10.1145/3549205Floating-point numbers represent only a subset of real numbers. As such, floating-point arithmetic introduces approximations that can compound and have a significant impact on numerical simulations. We introduce encapsulated error, a new way to estimate ...
- research-articleSeptember 2022
Configurable Open-source Data Structure for Distributed Conforming Unstructured Homogeneous Meshes with GPU Support
ACM Transactions on Mathematical Software (TOMS), Volume 48, Issue 3Article No.: 30, Pages 1–30https://doi.org/10.1145/3536164A general multi-purpose data structure for an efficient representation of conforming unstructured homogeneous meshes for scientific computations on CPU and GPU-based systems is presented. The data structure is provided as open-source software as part of ...
- research-articleApril 2021
Replicated Computational Results (RCR) Report for “Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software”
ACM Transactions on Mathematical Software (TOMS), Volume 47, Issue 2Article No.: 15, Pages 1–4https://doi.org/10.1145/3446000The article by Flegar et al. titled “Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software” presents a novel, practical implementation of an adaptive precision block-Jacobi preconditioner. Performance ...
- research-articleNovember 2020
Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks
ACM Transactions on Mathematical Software (TOMS), Volume 46, Issue 4Article No.: 37, Pages 1–33https://doi.org/10.1145/3412380Popular deep neural networks (DNNs) spend the majority of their execution time computing convolutions. The Winograd family of algorithms can greatly reduce the number of arithmetic operations required and is used in many DNN software frameworks. However,...
- technical-noteDecember 2019
Replicated Computational Results (RCR) Report for “Code Generation for Generally Mapped Finite Elements”
ACM Transactions on Mathematical Software (TOMS), Volume 45, Issue 4Article No.: 42, Pages 1–7https://doi.org/10.1145/3360984“Code Generation for Generally Mapped Finite Elements” includes performance results for the finite element methods discussed in that manuscript. The authors provided a Zenodo archive with the Firedrake components and dependencies used, as well as the ...
-
- research-articleFebruary 2019
Algorithm 991: The 2D Tree Sliding Window Discrete Fourier Transform
ACM Transactions on Mathematical Software (TOMS), Volume 45, Issue 1Article No.: 12, Pages 1–12https://doi.org/10.1145/3264426We present a new algorithm for the 2D sliding window discrete Fourier transform. Our algorithm avoids repeating calculations in overlapping windows by storing them in a tree data-structure based on the ideas of the Cooley-Tukey fast Fourier transform. ...
- research-articleJune 2018
Practical Polytope Volume Approximation
ACM Transactions on Mathematical Software (TOMS), Volume 44, Issue 4Article No.: 38, Pages 1–21https://doi.org/10.1145/3194656We experimentally study the fundamental problem of computing the volume of a convex polytope given as an intersection of linear halfspaces. We implement and evaluate randomized polynomial-time algorithms for accurately approximating the polytope’s ...
- research-articleMarch 2016
Replicated Computational Results (RCR) Report for A Sparse Symmetric Indefinite Direct Solver for GPU Architectures
ACM Transactions on Mathematical Software (TOMS), Volume 42, Issue 1Article No.: 2, Pages 1–10https://doi.org/10.1145/2851489A Sparse Symmetric Indefinite Direct Solver for GPU Architectures includes performance results and comparisons of the developed GPU direct solver against a CPU direct solver. New performance data were gathered using software provided by the manuscript ...
- opinionJune 2015
Editorial: ACM TOMS Replicated Computational Results Initiative
ACM Transactions on Mathematical Software (TOMS), Volume 41, Issue 3Article No.: 13, Pages 1–5https://doi.org/10.1145/2743015The scientific community relies on the peer review process for assuring the quality of published material, the goal of which is to build a body of work we can trust. Computational journals such as the ACM Transactions on Mathematical Software (TOMS) use ...
- research-articleDecember 2011
Exploiting parallelism in matrix-computation kernels for symmetric multiprocessor systems: Matrix-multiplication and matrix-addition algorithm optimizations by software pipelining and threads allocation
ACM Transactions on Mathematical Software (TOMS), Volume 38, Issue 1Article No.: 2, Pages 1–30https://doi.org/10.1145/2049662.2049664We present a simple and efficient methodology for the development, tuning, and installation of matrix algorithms such as the hybrid Strassen's and Winograd's fast matrix multiply or their combination with the 3M algorithm for complex matrices (i.e., ...
- research-articleDecember 2011
The university of Florida sparse matrix collection
ACM Transactions on Mathematical Software (TOMS), Volume 38, Issue 1Article No.: 1, Pages 1–25https://doi.org/10.1145/2049662.2049663We describe the University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications. The Collection is widely used by the numerical linear algebra community for the development and ...
- research-articleMarch 2009
Adaptive Winograd's matrix multiplications
ACM Transactions on Mathematical Software (TOMS), Volume 36, Issue 1Article No.: 3, Pages 1–23https://doi.org/10.1145/1486525.1486528Modern architectures have complex memory hierarchies and increasing parallelism (e.g., multicores). These features make achieving and maintaining good performance across rapidly changing architectures increasingly difficult. Performance has become a ...
- articleJune 2007
MPFR: A multiple-precision binary floating-point library with correct rounding
ACM Transactions on Mathematical Software (TOMS), Volume 33, Issue 2Pages 13–eshttps://doi.org/10.1145/1236463.1236468This article presents a multiple-precision binary floating-point library, written in the ISO C language, and based on the GNU MP library. Its particularity is to extend to arbitrary-precision, ideas from the IEEE 754 standard, by providing correct ...
- articleMarch 2001
A precision- and range-independent tool for testing floating-point arithmetic II: conversions
ACM Transactions on Mathematical Software (TOMS), Volume 27, Issue 1Pages 119–140https://doi.org/10.1145/382043.382405The IEEE 754 and 854 standards for floating-point arithmetic are essentially a specification of a programming environment, encompassing aspects from computer hardware, operating systems, and compilers to programming languages (see especially Section 8). ...
- articleMarch 2001
A precision- and range-independent tool for testing floating-point arithmetric I: basic operations, square root, and remainder
ACM Transactions on Mathematical Software (TOMS), Volume 27, Issue 1Pages 92–118https://doi.org/10.1145/382043.382404This paper introduces a precision- and range-independent tool for testing the compliance of hardware or software implementations of (multiprecision) floating-point arithmetic with the principles of the IEEE standards 754 and 854. The tool consists of a ...
- articleSeptember 1997
Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface
ACM Transactions on Mathematical Software (TOMS), Volume 23, Issue 3Pages 379–401https://doi.org/10.1145/275323.275327This article proposes a set of Level 3 Basic Linear Algebra Subprograms and associated kernels for sparse matrices. A major goal is to design and develop a common framework to enable efficient, and portable, implementations of iterative algorithms for ...
- articleDecember 1996
Remark on “Fast floating-point processing in Common Lisp”
ACM Transactions on Mathematical Software (TOMS), Volume 22, Issue 4Pages 496–497https://doi.org/10.1145/235815.235824We explain why we feel that the comparison betwen Common Lisp and Fortran in a recent article by Fateman et al. in this journal is not entirely fair.
Algorithm 711: BTN: software for parallel unconstrained optimization
ACM Transactions on Mathematical Software (TOMS), Volume 18, Issue 4Pages 414–448https://doi.org/10.1145/138351.138359BTN is a collection of FORTRAN subroutines for solving unconstrained nonlinear optimization problems. It currently runs on both Intel hypercube computers (distributed memory) and Sequent computers (shared memory), and can take advantage of vector ...