Parallel computing methodologies

Applied Filters

Publication Date

People

Publications

8 Results for: Book/Issue: SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,856,395 records)|Limit your search to The ACM Full-Text Collection (778,822 records)

Showing 1 - 8of8 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

extended-abstract
July 2020
Communication Lower Bounds of Convolutions in CNNs
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 591–593https://doi.org/10.1145/3350755.3400267

Convolution is the most time-consuming part in the computation of convolutional neural networks (CNNs). Due to the complex data dependency and the increase in the amount of model samples, the convolution suffers from high overhead on data movement. This ...
1
187
Metrics
Total Citations1
Total Downloads187
Last 12 Months5
Last 6 weeks1
Get Access
extended-abstract
July 2020
On the Limits of Parallelizing Convolutional Neural Networks on GPUs
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 567–569https://doi.org/10.1145/3350755.3400266

GPUs are currently the platform of choice for training neural networks. However, training a deep neural network (DNN) is a time-consuming process even on GPUs because of the massive number of parameters that have to be learned. As a result, accelerating ...
10
269
Metrics
Total Citations10
Total Downloads269
Last 12 Months26
Last 6 weeks2
Get Access
extended-abstract
July 2020
Provable Neuromorphic Advantages for Computing Shortest Paths
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 497–499https://doi.org/10.1145/3350755.3400258

Neuromorphic computing offers the potential of an unprecedented level of parallelism at a local scale. Although in their infancy, current first-generation neuromorphic processing units (NPUs) deliver as many as 128K artificial neurons in a package ...
12
299
Metrics
Total Citations12
Total Downloads299
Last 12 Months46
Last 6 weeks13
Get Access
extended-abstract
Public Access
July 2020
ParlayLib - A Toolkit for Parallel Algorithms on Shared-Memory Multicore Machines
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 507–509https://doi.org/10.1145/3350755.3400254

ParlayLib is a C++ library for developing efficient parallel algorithms and software on shared-memory multicore machines. It provides additional tools and primitives that go beyond what is available in the C++ standard library, and simplifies the task ...
35
756
Metrics
Total Citations35
Total Downloads756
Last 12 Months230
Last 6 weeks31
View online with eReader
PDF
research-article
Open Access
July 2020
Honorable Mention
Optimal Parallel Algorithms in the Binary-Forking Model
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 89–102https://doi.org/10.1145/3350755.3400227

In this paper we develop optimal algorithms in the binary-forking model for a variety of fundamental problems, including sorting, semisorting, list ranking, tree contraction, range minima, and ordered set union, intersection and difference. In the ...
29
1,130
Metrics
Total Citations29
Total Downloads1,130
Last 12 Months266
Last 6 weeks41
View online with eReader
PDF
short-paper
July 2020
Giving Future(s) to Transactional Memory
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 587–589https://doi.org/10.1145/3350755.3400220

This paper extends the Transactional Memory (TM) paradigm by proposing a new powerful abstraction, the transactional future. Transactional futures, as the name suggests, combine TM with futures, by allowing programmers to exploit intra-transaction ...
0
95
Metrics
Total Citations0
Total Downloads95
Last 12 Months6
Last 6 weeks0
Get Access
research-article
July 2020
Bandwidth Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation Blocking
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 293–303https://doi.org/10.1145/3350755.3400216

Sparse matrix-matrix multiplication (SpGEMM) is a widely used kernel in various graph, scientific computing and machine learning algorithms. It is well known that SpGEMM is a memory-bound operation, and its peak performance is expected to be bound by ...
17
387
Metrics
Total Citations17
Total Downloads387
Last 12 Months73
Last 6 weeks9
Get Access
research-article
July 2020
Memory Tagging: Minimalist Synchronization for Scalable Concurrent Data Structures
SPAA '20: Proceedings of the 32nd ACM Symposium on Parallelism in Algorithms and ArchitecturesPages 37–49https://doi.org/10.1145/3350755.3400213

There has been a significant amount of research on hardware and software support for efficient concurrent data structures; yet, the question of how to build correct, simple, and scalable data structures has not yet been definitively settled. In this ...
1
192
Metrics
Total Citations1
Total Downloads192
Last 12 Months6
Last 6 weeks0
Get Access

Applied Filters

Publication Date

People

Authors

Institutions

Publications

All Publications

Content Type

Paper Award

Publisher

Proceedings Series

ACM SIG Sponsors

Results

Communication Lower Bounds of Convolutions in CNNs

On the Limits of Parallelizing Convolutional Neural Networks on GPUs

Provable Neuromorphic Advantages for Computing Shortest Paths

ParlayLib - A Toolkit for Parallel Algorithms on Shared-Memory Multicore Machines

Optimal Parallel Algorithms in the Binary-Forking Model

Giving Future(s) to Transactional Memory

Bandwidth Optimized Parallel Algorithms for Sparse Matrix-Matrix Multiplication using Propagation Blocking

Memory Tagging: Minimalist Synchronization for Scalable Concurrent Data Structures