General and reference

Applied Filters

People

Publications

Reproducibility Badges

Publication Date

Searched The ACM Guide to Computing Literature (3,797,140 records)|Limit your search to The ACM Full-Text Collection (767,997 records)

Showing 1 - 20of33 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
Open Access
March 2023
Artifacts Available / v1.1
Artifacts Evaluated & Reusable / v1.1
Algorithm 1029: Encapsulated Error, a Direct Approach to Evaluate Floating-Point Accuracy
ACM Transactions on Mathematical Software (TOMS), Volume 48, Issue 4Article No.: 47, Pages 1–16https://doi.org/10.1145/3549205
Floating-point numbers represent only a subset of real numbers. As such, floating-point arithmetic introduces approximations that can compound and have a significant impact on numerical simulations. We introduce encapsulated error, a new way to estimate ...
0
809
Metrics
Total Citations0
Total Downloads809
Last 12 Months505
Last 6 weeks84
1
Supplementary Material
3549205.pdf
View online with eReader
PDF
research-article
Open Access
September 2022
Configurable Open-source Data Structure for Distributed Conforming Unstructured Homogeneous Meshes with GPU Support
ACM Transactions on Mathematical Software (TOMS), Volume 48, Issue 3Article No.: 30, Pages 1–30https://doi.org/10.1145/3536164
A general multi-purpose data structure for an efficient representation of conforming unstructured homogeneous meshes for scientific computations on CPU and GPU-based systems is presented. The data structure is provided as open-source software as part of ...
0
1,421
Metrics
Total Citations0
Total Downloads1,421
Last 12 Months475
Last 6 weeks59
View online with eReader
PDF
research-article
Open Access
April 2021
Replicated Computational Results (RCR) Report for “Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software”
- Sarah Osborn
ACM Transactions on Mathematical Software (TOMS), Volume 47, Issue 2Article No.: 15, Pages 1–4https://doi.org/10.1145/3446000

The article by Flegar et al. titled “Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software” presents a novel, practical implementation of an adaptive precision block-Jacobi preconditioner. Performance ...
0
305
Metrics
Total Citations0
Total Downloads305
Last 12 Months132
Last 6 weeks16
View online with eReader
View this article in HTML format
PDF
research-article
Open Access
November 2020
Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks
ACM Transactions on Mathematical Software (TOMS), Volume 46, Issue 4Article No.: 37, Pages 1–33https://doi.org/10.1145/3412380

Popular deep neural networks (DNNs) spend the majority of their execution time computing convolutions. The Winograd family of algorithms can greatly reduce the number of arithmetic operations required and is used in many DNN software frameworks. However,...
21
686
Metrics
Total Citations21
Total Downloads686
Last 12 Months240
Last 6 weeks41
View online with eReader
View this article in HTML format
PDF
technical-note
December 2019
Replicated Computational Results (RCR) Report for “Code Generation for Generally Mapped Finite Elements”
- Neil Lindquist
ACM Transactions on Mathematical Software (TOMS), Volume 45, Issue 4Article No.: 42, Pages 1–7https://doi.org/10.1145/3360984

“Code Generation for Generally Mapped Finite Elements” includes performance results for the finite element methods discussed in that manuscript. The authors provided a Zenodo archive with the Firedrake components and dependencies used, as well as the ...
0
127
Metrics
Total Citations0
Total Downloads127
Last 12 Months6
Last 6 weeks2
Get Access
research-article
February 2019
Algorithm 991: The 2D Tree Sliding Window Discrete Fourier Transform
- Lee F. Richardson,
- William F. Eddy
ACM Transactions on Mathematical Software (TOMS), Volume 45, Issue 1Article No.: 12, Pages 1–12https://doi.org/10.1145/3264426

We present a new algorithm for the 2D sliding window discrete Fourier transform. Our algorithm avoids repeating calculations in overlapping windows by storing them in a tree data-structure based on the ideas of the Cooley-Tukey fast Fourier transform. ...
0
663
Metrics
Total Citations0
Total Downloads663
Last 12 Months8
Last 6 weeks0
1
Supplementary Material
991.zip
Get Access
research-article
June 2018
Practical Polytope Volume Approximation
- Ioannis Z. Emiris,
- Vissarion Fisikopoulos
ACM Transactions on Mathematical Software (TOMS), Volume 44, Issue 4Article No.: 38, Pages 1–21https://doi.org/10.1145/3194656

We experimentally study the fundamental problem of computing the volume of a convex polytope given as an intersection of linear halfspaces. We implement and evaluate randomized polynomial-time algorithms for accurately approximating the polytope’s ...
22
758
Metrics
Total Citations22
Total Downloads758
Last 12 Months93
Last 6 weeks13
Get Access
research-article
March 2016
Replicated Computational Results (RCR) Report for A Sparse Symmetric Indefinite Direct Solver for GPU Architectures
- Eric T. Bavier
ACM Transactions on Mathematical Software (TOMS), Volume 42, Issue 1Article No.: 2, Pages 1–10https://doi.org/10.1145/2851489

A Sparse Symmetric Indefinite Direct Solver for GPU Architectures includes performance results and comparisons of the developed GPU direct solver against a CPU direct solver. New performance data were gathered using software provided by the manuscript ...
0
197
Metrics
Total Citations0
Total Downloads197
Last 12 Months8
Last 6 weeks0
1
Supplementary Material
a2-bavier-apndx.zip
Get Access
opinion
Free
June 2015
Editorial: ACM TOMS Replicated Computational Results Initiative
- Michael A. Heroux
ACM Transactions on Mathematical Software (TOMS), Volume 41, Issue 3Article No.: 13, Pages 1–5https://doi.org/10.1145/2743015

The scientific community relies on the peer review process for assuring the quality of published material, the goal of which is to build a body of work we can trust. Computational journals such as the ACM Transactions on Mathematical Software (TOMS) use ...
22
618
Metrics
Total Citations22
Total Downloads618
Last 12 Months87
Last 6 weeks17
View online with eReader
PDF
research-article
December 2011
Exploiting parallelism in matrix-computation kernels for symmetric multiprocessor systems: Matrix-multiplication and matrix-addition algorithm optimizations by software pipelining and threads allocation
ACM Transactions on Mathematical Software (TOMS), Volume 38, Issue 1Article No.: 2, Pages 1–30https://doi.org/10.1145/2049662.2049664

We present a simple and efficient methodology for the development, tuning, and installation of matrix algorithms such as the hybrid Strassen's and Winograd's fast matrix multiply or their combination with the 3M algorithm for complex matrices (i.e., ...
19
1,067
Metrics
Total Citations19
Total Downloads1,067
Last 12 Months21
Last 6 weeks2
Get Access
research-article
December 2011
The university of Florida sparse matrix collection
- Timothy A. Davis,
- Yifan Hu
ACM Transactions on Mathematical Software (TOMS), Volume 38, Issue 1Article No.: 1, Pages 1–25https://doi.org/10.1145/2049662.2049663

We describe the University of Florida Sparse Matrix Collection, a large and actively growing set of sparse matrices that arise in real applications. The Collection is widely used by the numerical linear algebra community for the development and ...
1,935
4,651
Metrics
Total Citations1,935
Total Downloads4,651
Last 12 Months512
Last 6 weeks67
Get Access
research-article
March 2009
Adaptive Winograd's matrix multiplications
- Paolo D'Alberto,
- Alexandru Nicolau
ACM Transactions on Mathematical Software (TOMS), Volume 36, Issue 1Article No.: 3, Pages 1–23https://doi.org/10.1145/1486525.1486528

Modern architectures have complex memory hierarchies and increasing parallelism (e.g., multicores). These features make achieving and maintaining good performance across rapidly changing architectures increasingly difficult. Performance has become a ...
15
692
Metrics
Total Citations15
Total Downloads692
Last 12 Months10
Last 6 weeks1
Get Access
article
June 2007
MPFR: A multiple-precision binary floating-point library with correct rounding
ACM Transactions on Mathematical Software (TOMS), Volume 33, Issue 2Pages 13–eshttps://doi.org/10.1145/1236463.1236468

This article presents a multiple-precision binary floating-point library, written in the ISO C language, and based on the GNU MP library. Its particularity is to extend to arbitrary-precision, ideas from the IEEE 754 standard, by providing correct ...
652
2,161
Metrics
Total Citations652
Total Downloads2,161
Last 12 Months115
Last 6 weeks14
Get Access
article
March 2001
A precision- and range-independent tool for testing floating-point arithmetic II: conversions
ACM Transactions on Mathematical Software (TOMS), Volume 27, Issue 1Pages 119–140https://doi.org/10.1145/382043.382405

The IEEE 754 and 854 standards for floating-point arithmetic are essentially a specification of a programming environment, encompassing aspects from computer hardware, operating systems, and compilers to programming languages (see especially Section 8). ...
13
752
Metrics
Total Citations13
Total Downloads752
Last 12 Months5
Last 6 weeks0
Get Access
article
March 2001
A precision- and range-independent tool for testing floating-point arithmetric I: basic operations, square root, and remainder
ACM Transactions on Mathematical Software (TOMS), Volume 27, Issue 1Pages 92–118https://doi.org/10.1145/382043.382404

This paper introduces a precision- and range-independent tool for testing the compliance of hardware or software implementations of (multiprecision) floating-point arithmetic with the principles of the IEEE standards 754 and 854. The tool consists of a ...
12
811
Metrics
Total Citations12
Total Downloads811
Last 12 Months9
Last 6 weeks1
Get Access
article
Free
June 2000
John R. Rice: biographical and professional notes
ACM Transactions on Mathematical Software (TOMS), Volume 26, Issue 2Pages 225–226https://doi.org/10.1145/353474.354105
0
274
Metrics
Total Citations0
Total Downloads274
Last 12 Months31
Last 6 weeks10
View online with eReader
PDF
editorial
Free
June 2000
Editorial: special issue in honor of John Rice's 65th birthday
ACM Transactions on Mathematical Software (TOMS), Volume 26, Issue 2Page 223https://doi.org/10.1145/353474.354094
1
407
Metrics
Total Citations1
Total Downloads407
Last 12 Months35
Last 6 weeks8
View online with eReader
PDF
article
Free
September 1997
Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface
ACM Transactions on Mathematical Software (TOMS), Volume 23, Issue 3Pages 379–401https://doi.org/10.1145/275323.275327

This article proposes a set of Level 3 Basic Linear Algebra Subprograms and associated kernels for sparse matrices. A major goal is to design and develop a common framework to enable efficient, and portable, implementations of iterative algorithms for ...
53
819
Metrics
Total Citations53
Total Downloads819
Last 12 Months88
Last 6 weeks14
View online with eReader
PDF
article
Free
December 1996
Remark on “Fast floating-point processing in Common Lisp”
- J. K. Reid
ACM Transactions on Mathematical Software (TOMS), Volume 22, Issue 4Pages 496–497https://doi.org/10.1145/235815.235824

We explain why we feel that the comparison betwen Common Lisp and Fortran in a recent article by Fateman et al. in this journal is not entirely fair.
2
449
Metrics
Total Citations2
Total Downloads449
Last 12 Months54
Last 6 weeks11
View online with eReader
PDF
article
Free
December 1992
Artifacts Available
Artifacts Evaluated & Reusable
Algorithm 711: BTN: software for parallel unconstrained optimization
- Stephen G. Nash,
- Ariela Sofer
ACM Transactions on Mathematical Software (TOMS), Volume 18, Issue 4Pages 414–448https://doi.org/10.1145/138351.138359

BTN is a collection of FORTRAN subroutines for solving unconstrained nonlinear optimization problems. It currently runs on both Intel hypercube computers (distributed memory) and Sequent computers (shared memory), and can take advantage of vector ...
10
503
Metrics
Total Citations10
Total Downloads503
Last 12 Months67
Last 6 weeks9
1
Supplementary Material
BTN (711.gz)
View online with eReader
PDF

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Reproducibility Badges

Publication Date

Algorithm 1029: Encapsulated Error, a Direct Approach to Evaluate Floating-Point Accuracy

Configurable Open-source Data Structure for Distributed Conforming Unstructured Homogeneous Meshes with GPU Support

Replicated Computational Results (RCR) Report for “Adaptive Precision Block-Jacobi for High Performance Preconditioning in the Ginkgo Linear Algebra Software”

Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks

Replicated Computational Results (RCR) Report for “Code Generation for Generally Mapped Finite Elements”

Algorithm 991: The 2D Tree Sliding Window Discrete Fourier Transform

Practical Polytope Volume Approximation

Replicated Computational Results (RCR) Report for A Sparse Symmetric Indefinite Direct Solver for GPU Architectures

Editorial: ACM TOMS Replicated Computational Results Initiative

Exploiting parallelism in matrix-computation kernels for symmetric multiprocessor systems: Matrix-multiplication and matrix-addition algorithm optimizations by software pipelining and threads allocation

The university of Florida sparse matrix collection

Adaptive Winograd's matrix multiplications

MPFR: A multiple-precision binary floating-point library with correct rounding

A precision- and range-independent tool for testing floating-point arithmetic II: conversions

A precision- and range-independent tool for testing floating-point arithmetric I: basic operations, square root, and remainder

John R. Rice: biographical and professional notes

Editorial: special issue in honor of John Rice's 65th birthday

Level 3 basic linear algebra subprograms for sparse matrices: a user-level interface

Remark on “Fast floating-point processing in Common Lisp”

Algorithm 711: BTN: software for parallel unconstrained optimization