Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- sectionMarch 2018
- research-articleMarch 2018
Slim NoC: A Low-Diameter On-Chip Network Topology for High Energy Efficiency and Scalability
- Maciej Besta,
- Syed Minhaj Hassan,
- Sudhakar Yalamanchili,
- Rachata Ausavarungnirun,
- Onur Mutlu,
- Torsten Hoefler
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 43–55https://doi.org/10.1145/3173162.3177158Emerging chips with hundreds and thousands of cores require networks with unprecedented energy/area efficiency and scalability. To address this, we propose Slim NoC (SN): a new on-chip network design that delivers significant improvements in efficiency ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
NEOFog: Nonvolatility-Exploiting Optimizations for Fog Computing
- Kaisheng Ma,
- Xueqing Li,
- Mahmut Taylan Kandemir,
- Jack Sampson,
- Vijaykrishnan Narayanan,
- Jinyang Li,
- Tongda Wu,
- Zhibo Wang,
- Yongpan Liu,
- Yuan Xie
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 782–796https://doi.org/10.1145/3173162.3177154Nonvolatile processors have emerged as one of the promising solutions for energy harvesting scenarios, among which Wireless Sensor Networks (WSN) provide some of the most important applications. In a typical distributed sensing system, due to difference ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
LTRF: Enabling High-Capacity Register Files for GPUs via Hardware/Software Cooperative Register Prefetching
- Mohammad Sadrosadati,
- Amirhossein Mirhosseini,
- Seyed Borna Ehsani,
- Hamid Sarbazi-Azad,
- Mario Drumond,
- Babak Falsafi,
- Rachata Ausavarungnirun,
- Onur Mutlu
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 489–502https://doi.org/10.1145/3173162.3173211Graphics Processing Units (GPUs) employ large register files to accommodate all active threads and accelerate context switching. Unfortunately, register files are a scalability bottleneck for future GPUs due to long access latency, high power ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
Wonderland: A Novel Abstraction-Based Out-Of-Core Graph Processing System
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 608–621https://doi.org/10.1145/3173162.3173208Many important graph applications are iterative algorithms that repeatedly process the input graph until convergence. For such algorithms, graph abstraction is an important technique: although much smaller than the original graph, it can bootstrap an ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
Espresso: Brewing Java For More Non-Volatility with Non-volatile Memory
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 70–83https://doi.org/10.1145/3173162.3173201Fast, byte-addressable non-volatile memory (NVM) embraces both near-DRAM latency and disk-like persistence, which has generated considerable interests to revolutionize system software stack and programming models. However, it is less understood how NVM ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
Devirtualizing Memory in Heterogeneous Systems
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 637–650https://doi.org/10.1145/3173162.3173194Accelerators are increasingly recognized as one of the major drivers of future computational growth. For accelerators, shared virtual memory (VM) promises to simplify programming and provide safe data sharing with CPUs. Unfortunately, the overheads of ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018Best Paper
Darwin: A Genomics Co-processor Provides up to 15,000X Acceleration on Long Read Assembly
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 199–213https://doi.org/10.1145/3173162.3173193Genomics is transforming medicine and our understanding of life in fundamental ways. Genomics data, however, is far outpacing Moore»s Law. Third-generation sequencing technologies produce 100X longer reads than second generation technologies and reveal ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
SmoothOperator: Reducing Power Fragmentation and Improving Power Utilization in Large-scale Datacenters
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 535–548https://doi.org/10.1145/3173162.3173190With the ever growing popularity of cloud computing and web services, Internet companies are in need of increased computing capacity to serve the demand. However, power has become a major limiting factor prohibiting the growth in industry: it is often ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
Exploiting Dynamic Thermal Energy Harvesting for Reusing in Smartphone with Mobile Applications
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 243–256https://doi.org/10.1145/3173162.3173188Recently, mobile applications have gradually become performance- and resource- intensive, which results in a massive battery power drain and high surface temperature, and further degrades the user experience. Thus, high power consumption and surface ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
CALOREE: Learning Control for Predictable Latency and Low Energy
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 184–198https://doi.org/10.1145/3173162.3173184Many modern computing systems must provide reliable latency with minimal energy. Two central challenges arise when allocating system resources to meet these conflicting goals: (1) complexity modern hardware exposes diverse resources with complicated ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
Time Dilation and Contraction for Programmable Analog Devices with Jaunt
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 229–242https://doi.org/10.1145/3173162.3173179Programmable analog devices are a powerful new computing substrate that are especially appropriate for performing computationally intensive simulations of neuromorphic and cytomorphic models. Current state of the art techniques for configuring analog ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
Google Workloads for Consumer Devices: Mitigating Data Movement Bottlenecks
- Amirali Boroumand,
- Saugata Ghose,
- Youngsok Kim,
- Rachata Ausavarungnirun,
- Eric Shiu,
- Rahul Thakur,
- Daehyun Kim,
- Aki Kuusela,
- Allan Knies,
- Parthasarathy Ranganathan,
- Onur Mutlu
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 316–331https://doi.org/10.1145/3173162.3173177We are experiencing an explosive growth in the number of consumer devices, including smartphones, tablets, web-based computers such as Chromebooks, and wearable devices. For this class of devices, energy efficiency is a first-class concern due to the ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
MAERI: Enabling Flexible Dataflow Mapping over DNN Accelerators via Reconfigurable Interconnects
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 461–475https://doi.org/10.1145/3173162.3173176Deep neural networks (DNN) have demonstrated highly promising results across computer vision and speech recognition, and are becoming foundational for ubiquitous AI. The computational complexity of these algorithms and a need for high energy-efficiency ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
In-Memory Data Parallel Processor
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 1–14https://doi.org/10.1145/3173162.3173171Recent developments in Non-Volatile Memories (NVMs) have opened up a new horizon for in-memory computing. Despite the significant performance gain offered by computational NVMs, previous works have relied on manual mapping of specialized kernels to the ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2 - research-articleMarch 2018
Liquid Silicon-Monona: A Reconfigurable Memory-Oriented Computing Fabric with Scalable Multi-Context Support
ASPLOS '18: Proceedings of the Twenty-Third International Conference on Architectural Support for Programming Languages and Operating SystemsPages 214–228https://doi.org/10.1145/3173162.3173167With the recent trend of promoting Field-Programmable Gate Arrays (FPGAs) to first-class citizens in accelerating compute-intensive applications in networking, cloud services and artificial intelligence, FPGAs face two major challenges in sustaining ...
Also Published in:
ACM SIGPLAN Notices: Volume 53 Issue 2