Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- extended-abstractJuly 2020
Communication Lower Bounds of Convolutions in CNNs
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 591–593https://doi.org/10.1145/3350755.3400267Convolution is the most time-consuming part in the computation of convolutional neural networks (CNNs). Due to the complex data dependency and the increase in the amount of model samples, the convolution suffers from high overhead on data movement. This ...
- extended-abstractJuly 2020
On the Limits of Parallelizing Convolutional Neural Networks on GPUs
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 567–569https://doi.org/10.1145/3350755.3400266GPUs are currently the platform of choice for training neural networks. However, training a deep neural network (DNN) is a time-consuming process even on GPUs because of the massive number of parameters that have to be learned. As a result, accelerating ...
- extended-abstractJuly 2020
Provable Neuromorphic Advantages for Computing Shortest Paths
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 497–499https://doi.org/10.1145/3350755.3400258Neuromorphic computing offers the potential of an unprecedented level of parallelism at a local scale. Although in their infancy, current first-generation neuromorphic processing units (NPUs) deliver as many as 128K artificial neurons in a package ...
- extended-abstractJuly 2020
ParlayLib - A Toolkit for Parallel Algorithms on Shared-Memory Multicore Machines
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 507–509https://doi.org/10.1145/3350755.3400254ParlayLib is a C++ library for developing efficient parallel algorithms and software on shared-memory multicore machines. It provides additional tools and primitives that go beyond what is available in the C++ standard library, and simplifies the task ...
- research-articleJuly 2020Honorable Mention
Optimal Parallel Algorithms in the Binary-Forking Model
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 89–102https://doi.org/10.1145/3350755.3400227In this paper we develop optimal algorithms in the binary-forking model for a variety of fundamental problems, including sorting, semisorting, list ranking, tree contraction, range minima, and ordered set union, intersection and difference. In the ...
- short-paperJuly 2020
Giving Future(s) to Transactional Memory
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 587–589https://doi.org/10.1145/3350755.3400220This paper extends the Transactional Memory (TM) paradigm by proposing a new powerful abstraction, the transactional future. Transactional futures, as the name suggests, combine TM with futures, by allowing programmers to exploit intra-transaction ...
- research-articleJuly 2020
Bandwidth Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation Blocking
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 293–303https://doi.org/10.1145/3350755.3400216Sparse matrix-matrix multiplication (SpGEMM) is a widely used kernel in various graph, scientific computing and machine learning algorithms. It is well known that SpGEMM is a memory-bound operation, and its peak performance is expected to be bound by ...
- research-articleJuly 2020
Memory Tagging: Minimalist Synchronization for Scalable Concurrent Data Structures
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 37–49https://doi.org/10.1145/3350755.3400213There has been a significant amount of research on hardware and software support for efficient concurrent data structures; yet, the question of how to build correct, simple, and scalable data structures has not yet been definitively settled. In this ...