default search action
ASPLOS 2024: La Jolla, CA, USA
- Rajiv Gupta, Nael B. Abu-Ghazaleh, Madan Musuvathi, Dan Tsafrir:
Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2, ASPLOS 2024, La Jolla, CA, USA, 27 April 2024- 1 May 2024. ACM 2024 - Junpyo Kim, Dongmoon Min, Jungmin Cho, Hyeonseong Jeong, Ilkwon Byun, Junhyuk Choi, Juwon Hong, Jangwoo Kim:
A Fault-Tolerant Million Qubit-Scale Distributed Quantum Computer. 1-19 - Michael Davies, Ian McDougall, Selvaraj Anandaraj, Deep Machchhar, Rithik Jain, Karthikeyan Sankaralingam:
A Journey of a 1, 000 Kernels Begins with a Single Step: A Retrospective of Deep Learning on GPUs. 20-36 - Reese Kuper, Ipoom Jeong, Yifan Yuan, Ren Wang, Narayan Ranganathan, Nikhil Rao, Jiayu Hu, Sanjay Kumar, Philip Lantz, Nam Sung Kim:
A Quantitative Analysis and Guidelines of Data Streaming Accelerator in Modern Intel Xeon Scalable Processors. 37-54 - Min Ye, Qiao Li, Yina Lv, Jie Zhang, Tianyu Ren, Daniel Wen, Tei-Wei Kuo, Chun Jason Xue:
Achieving Near-Zero Read Retry for 3D NAND Flash Memory. 55-70 - Yixun Wei, Bingzhe Li, David H. C. Du:
An Encoding Scheme to Enlarge Practical DNA Storage Capacity by Reducing Primer-Payload Collisions. 71-84 - Alberto Delmas Lascorz, Mostafa Mahmoud, Ali Hadi Zadeh, Milos Nikolic, Kareem Ibrahim, Christina Giannoula, Ameer Abdelhadi, Andreas Moshovos:
Atalanta: A Bit is Worth a "Thousand" Tensor Values. 85-102 - Jaehyun Park, Jaewan Choi, Kwanhee Kyung, Michael Jaemin Kim, Yongsuk Kwon, Nam Sung Kim, Jung Ho Ahn:
AttAcc! Unleashing the Power of PIM for Batched Transformer-based Generative Model Inference. 103-119 - Michael Flanders, Reshabh K. Sharma, Alexandra E. Michael, Dan Grossman, David Kohlbrenner:
Avoiding Instruction-Centric Microarchitectural Timing Channels Via Binary-Code Transformations. 120-136 - Nikola Samardzic, Daniel Sánchez:
BitPacker: Enabling High Arithmetic Efficiency in Fully Homomorphic Encryption Accelerators. 137-150 - Ziyuan Wen, Lingkun Kong, Alexis Le Glaunec, Konstantinos Mamouras, Kaiyuan Yang:
BVAP: Energy and Memory Efficient Automata Processing for Regular Expressions with Bounded Repetitions. 151-166 - Zhewen Pan, Joshua San Miguel, Di Wu:
Carat: Unlocking Value-Level Parallelism for Multiplier-Free GEMMs. 167-184 - Songyun Qu, Shixin Zhao, Bing Li, Yintao He, Xuyi Cai, Lei Zhang, Ying Wang:
CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators. 185-200 - Zhuoran Song, Chunyu Qi, Fangxin Liu, Naifeng Jing, Xiaoyao Liang:
CMC: Video Transformer Acceleration via CODEC Assisted Matrix Condensing. 201-215 - Sophia Fuhui Lin, Joshua Viszlai, Kaitlin N. Smith, Gokul Subramanian Ravi, Charles Yuan, Frederic T. Chong, Benjamin J. Brown:
Codesign of quantum error-correcting codes and modular chiplets in the presence of defects. 216-231 - Yian Su, Mike Rainey, Nick Wanninger, Nadharm Dhiantravan, Jasper Liang, Umut A. Acar, Peter A. Dinda, Simone Campanoni:
Compiling Loop-Based Nested Parallelism for Irregular Workloads. 232-250 - Nathaniel Wesley Filardo, Brett F. Gutstein, Jonathan Woodruff, Jessica Clarke, Peter Rugg, Brooks Davis, Mark Johnston, Robert M. Norton, David Chisnall, Simon W. Moore, Peter G. Neumann, Robert N. M. Watson:
Cornucopia Reloaded: Load Barriers for CHERI Heap Temporal Safety. 251-268 - Yu-Neng Wang, Glenn E. R. Cowan, Ulrich Rührmair, Sara Achour:
Design of Novel Analog Compute Paradigms with Ark. 269-286 - Jiyuan Zhang, Weiwei Jia, Siyuan Chai, Peizhe Liu, Jongyul Kim, Tianyin Xu:
Direct Memory Translation for Virtualized Clouds. 287-304 - Zhihong Luo, Sam Son, Dev Bali, Emmanuel Amaro, Amy Ousterhout, Sylvia Ratnasamy, Scott Shenker:
Efficient Microsecond-scale Blind Scheduling with Tiny Quanta. 305-319 - Yuhong Wen, Xiaogang Zhao, You Zhou, Tong Zhang, Shangjun Yang, Changsheng Xie, Fei Wu:
Eliminating Storage Management Overhead of Deduplication over SSD Arrays Through a Hardware/Software Co-Design. 320-335 - Sashwat Anagolum, Narges Alavisamani, Poulami Das, Moinuddin K. Qureshi, Yunong Shi:
Elivagar: Efficient Quantum Circuit Search for Classification. 336-353 - Rhys Gretsch, Peiyang Song, Advait Madhavan, Jeremy Lau, Timothy Sherwood:
Energy Efficient Convolutions with Temporal Arithmetic. 354-368 - Hyungjun Oh, Kihong Kim, Jaemin Kim, Sungkyun Kim, Junyeol Lee, Du-Seong Chang, Jiwon Seo:
ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference. 369-384 - Yushi Liu, Shixuan Sun, Zijun Li, Quan Chen, Sen Gao, Bingsheng He, Chao Li, Minyi Guo:
FaaSGraph: Enabling Scalable, Efficient, and Cost-Effective Graph Processing with Serverless Computing. 385-400 - Lieven Eeckhout:
FOCAL: A First-Order Carbon Model to Assess Processor Sustainability. 401-415 - Gus Henry Smith, Benjamin Kushigian, Vishal Canumalla, Andrew Cheung, Steven Lyubomirsky, Sorawee Porncharoenwase, René Just, Gilbert Louis Bernstein, Zachary Tatlock:
FPGA Technology Mapping Using Sketch-Guided Program Synthesis. 416-432 - Hao Ling, Heqing Huang, Chengpeng Wang, Yuandao Cai, Charles Zhang:
GIANTSAN: Efficient Memory Sanitization with Segment Folding. 433-449 - Cong Guo, Rui Zhang, Jiale Xu, Jingwen Leng, Zihan Liu, Ziyu Huang, Minyi Guo, Hao Wu, Shouren Zhao, Junping Zhao, Ke Zhang:
GMLake: Efficient and Transparent GPU Memory Defragmentation for Large-scale DNN Training with Virtual Memory Stitching. 450-466 - Tsun-Yu Yang, Cale England, Yi Li, Bingzhe Li, Ming-Chang Yang:
Grafu: Unleashing the Full Potential of Future Value Computation for Out-of-core Synchronous Graph Processing. 467-481 - Dylan Wolff, Zheng Shi, Gregory J. Duck, Umang Mathur, Abhik Roychoudhury:
Greybox Fuzzing for Concurrency Testing. 482-498 - Zizhao Mo, Huanle Xu, Chengzhong Xu:
Heet: Accelerating Elastic Training in Heterogeneous Deep Learning Clusters. 499-513 - Akash Kothari, Abdul Rafae Noor, Muchen Xu, Hassam Uddin, Dhruv Baronia, Stefanos Baziotis, Vikram S. Adve, Charith Mendis, Sudipta Sengupta:
Hydride: A Retargetable and Extensible Synthesis-based Compiler for Modern Hardware Architectures. 514-529 - Rohan Mahapatra, Soroush Ghodrati, Byung Hoon Ahn, Sean Kinzer, Shu-Ting Wang, Hanyang Xu, Lavanya Karthikeyan, Hardik Sharma, Amir Yazdanbakhsh, Mohammad Alian, Hadi Esmaeilzadeh:
In-Storage Domain-Specific Acceleration for Serverless Computing. 530-548 - Zihan Liu, Wentao Ni, Jingwen Leng, Yu Feng, Cong Guo, Quan Chen, Chao Li, Minyi Guo, Yuhao Zhu:
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping. 549-565 - Hochan Lee, Roshan Dathathri, Keshav Pingali:
Kimbap: A Node-Property Map System for Distributed Graph Analytics. 566-581 - Zirui Neil Zhao, Adam Morrison, Christopher W. Fletcher, Josep Torrellas:
Last-Level Cache Side-Channel Attacks Are Feasible in the Modern Public Cloud. 582-600 - Yuanyi Zhang, Heng Zhang, Wenbin Cao, Xing He, Daejun Park, Jinyoung Choi, SungJun Park:
LazyBarrier: Reconstructing Android IO Stack for Barrier-Enabled Flash Storage. 601-615 - Juntaek Lim, Youngeun Kwon, Ranggi Hwang, Kiwan Maeng, G. Edward Suh, Minsoo Rhu:
LazyDP: Co-Designing Algorithm-Software for Scalable Training of Differentially Private Recommendation Models. 616-630 - Adwait Godbole, Kevin Cheang, Yatin A. Manerkar, Sanjit A. Seshia:
Lifting Micro-Update Models from RTL for Formal Security Analysis. 631-648 - Zachary Yedidia:
Lightweight Fault Isolation: Practical, Efficient, and Secure Software Sandboxing. 649-665 - Eugene Sha, Andy Liu, Kareem Ibrahim, Mostafa Mahmoud, Christina Giannoula, Ameer Abdelhadi, Andreas Moshovos:
Marple: Scalable Spike Sorting for Untethered Brain-Machine Interfacing. 666-682 - Hongwu Peng, Xi Xie, Kaustubh Shivdikar, Md Amit Hasan, Jiahui Zhao, Shaoyi Huang, Omer Khan, David R. Kaeli, Caiwen Ding:
MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training. 683-698 - Hezi Zhang, Keyi Yin, Anbang Wu, Hassan Shapourian, Alireza Shabani, Yufei Ding:
MECH: Multi-Entry Communication Highway for Superconducting Quantum Chiplets. 699-714 - Anagha Molakalmur Anil Kumar, Aditya Prasanna, Jonathan Balkind, Arrvindh Shriraman:
METAL: Caching Multi-level Indexes in Domain-Specific Architectures. 715-729 - Nuntipat Narkthong, Shijin Duan, Shaolei Ren, Xiaolin Xu:
MicroVSA: An Ultra-Lightweight Vector Symbolic Architecture-based Classifier Library for Always-On Inference on Tiny Microcontrollers. 730-745 - Zishen Wan, Nandhini Chandramoorthy, Karthik Swaminathan, Pin-Yu Chen, Kshitij Bhardwaj, Vijay Janapa Reddi, Arijit Raychowdhury:
MulBERRY: Enabling Bit-Error Robustness for Energy-Efficient Multi-Agent Autonomous Systems. 746-762 - Jia-Ju Bai, Haoxuan Song, Shimin Hu:
Multi-Dimensional and Message-Guided Fuzzing for Robotic Programs in Robot Operating System. 763-778 - Jianxin Chen, Dawei Ding, Weiyuan Gong, Cupjin Huang, Qi Ye:
One Gate Scheme to Rule Them All: Introducing a Complex Yet Reduced Instruction Set for Quantum Computing. 779-796 - Feng Yu, Guangli Li, Jiacheng Zhao, Huimin Cui, Xiaobing Feng, Jingling Xue:
Optimizing Dynamic-Shape Neural Networks on Accelerators via On-the-Fly Micro-Kernel Polymerization. 797-812 - Yuhui Hao, Yiming Gan, Bo Yu, Qiang Liu, Yinhe Han, Zishen Wan, Shaoshan Liu:
ORIANNA: An Accelerator Generation Framework for Optimization-based Robotic Applications. 813-829 - Hongming Huang, Peng Wang, Qiang Su, Hong Xu, Chun Jason Xue, André Brinkmann:
Palantir: Hierarchical Similarity Detection for Post-Deduplication Delta Compression. 830-845 - Bhargav Reddy Godala, Sankara Prasad Ramesh, Gilles A. Pokam, Jared Stark, André Seznec, Dean M. Tullsen, David I. August:
PDIP: Priority Directed Instruction Prefetching. 846-861 - Colin Drewes, Olivia Weng, Andres Meza, Alric Althoff, David Kohlbrenner, Ryan Kastner, Dustin Richmond:
Pentimento: Data Remanence in Cloud FPGAs. 862-878 - Cong Li, Zhe Zhou, Yang Wang, Fan Yang, Ting Cao, Mao Yang, Yun Liang, Guangyu Sun:
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization. 879-896 - André Lopes, Daniel Castro, Paolo Romano:
PIM-STM: Software Transactional Memory for Processing-In-Memory Systems. 897-911 - Anshunkang Zhou, Chengfeng Ye, Heqing Huang, Yuandao Cai, Charles Zhang:
Plankton: Reconciling Binary Code and Debug Information. 912-928 - Jason Ansel, Edward Z. Yang, Horace He, Natalia Gimelshein, Animesh Jain, Michael Voznesensky, Bin Bao, Peter Bell, David Berard, Evgeni Burovski, Geeta Chauhan, Anjali Chourdia, Will Constable, Alban Desmaison, Zachary DeVito, Elias Ellison, Will Feng, Jiong Gong, Michael Gschwind, Brian Hirsh, Sherlock Huang, Kshiteej Kalambarkar, Laurent Kirsch, Michael Lazos, Mario Lezcano, Yanbo Liang, Jason Liang, Yinghai Lu, C. K. Luk, Bert Maher, Yunjie Pan, Christian Puhrsch, Matthias Reso, Mark Saroufim, Marcos Yukio Siraichi, Helen Suk, Shunting Zhang, Michael Suo, Phil Tillet, Xu Zhao, Eikan Wang, Keren Zhou, Richard Zou, Xiaodong Wang, Ajit Mathews, William Wen, Gregory Chanan, Peng Wu, Soumith Chintala:
PyTorch 2: Faster Machine Learning Through Dynamic Python Bytecode Transformation and Graph Compilation. 929-947 - Siwei Tan, Liqiang Lu, Hanyu Zhang, Jia Yu, Congliang Lang, Yongheng Shang, Xinkui Zhao, Mingshuai Chen, Yun Liang, Jianwei Yin:
QuFEM: Fast and Accurate Quantum Readout Calibration Using the Finite Element Method. 948-963 - Zheng Wang, Yuke Wang, Jiaqi Deng, Da Zheng, Ang Li, Yufei Ding:
RAP: Resource-aware Automated GPU Sharing for Multi-GPU Recommendation Model Training and Input Preprocessing. 964-979 - Meng Wang, Bo Fang, Ang Li, Prashant J. Nair:
Red-QAOA: Efficient Variational Optimization through Circuit Reduction. 980-998 - Yuxuan Zhang, Nathan Sobotka, Soyoon Park, Saba Jamilan, Tanvir Ahmed Khan, Baris Kasikci, Gilles A. Pokam, Heiner Litz, Joseph Devietti:
RPG2: Robust Profile-Guided Runtime Prefetch Generation. 999-1013 - Anish Saxena, Saurav Mathur, Moinuddin K. Qureshi:
Rubix: Reducing the Overhead of Secure Rowhammer Mitigations via Randomized Line-to-Row Mapping. 1014-1028 - Jianyi Cheng, Samuel Coward, Lorenzo Chelini, Rafael Barbalho, Theo Drane:
SEER: Super-Optimization Explorer for High-Level Synthesis using E-graph Rewriting. 1029-1044 - Benjamin Holmes, Jason Waterman, Dan Williams:
SEVeriFast: Minimizing the root of trust for fast startup of SEV microVMs. 1045-1060 - Erhu Feng, Dahu Feng, Dong Du, Yubin Xia, Wenbin Zheng, Siqi Zhao, Haibo Chen:
sIOPMP: Scalable and Efficient I/O Protection for TEEs. 1061-1076 - Shashank Anand, Michal Friedman, Michael Giardino, Gustavo Alonso:
Skip It: Take Control of Your Cache! 1077-1094 - Hongzheng Chen, Cody Hao Yu, Shuai Zheng, Zhen Zhang, Zhiru Zhang, Yida Wang:
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training. 1095-1111 - Xupeng Miao, Chunan Shi, Jiangfei Duan, Xiaoli Xi, Dahua Lin, Bin Cui, Zhihao Jia:
SpotServe: Serving Generative Large Language Models on Preemptible Instances. 1112-1127 - Jonas Juffinger, Stepan Kalinin, Daniel Gruss, Frank Mueller:
SUIT: Secure Undervolting with Instruction Traps. 1128-1145 - Suchita Pati, Shaizeen Aga, Mahzabeen Islam, Nuwan Jayasena, Matthew D. Sinclair:
T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives. 1146-1164 - Soroush Ghodrati, Sean Kinzer, Hanyang Xu, Rohan Mahapatra, Yoonsung Kim, Byung Hoon Ahn, Dong Kai Wang, Lavanya Karthikeyan, Amir Yazdanbakhsh, Jongse Park, Nam Sung Kim, Hadi Esmaeilzadeh:
Tandem Processor: Grappling with Emerging Operators in Neural Networks. 1165-1182 - Yufeng Wang, Charith Mendis:
TGLite: A Lightweight Programming Framework for Continuous-Time Temporal Graph Neural Networks. 1183-1199 - Charles Block, Gerasimos Gerogiannis, Charith Mendis, Ariful Azad, Josep Torrellas:
Two-Face: Combining Collective and One-Sided Communication for Efficient Distributed SpMM. 1200-1217 - Zhenyang Dai, Shuang Liu, Vilhelm Sjöberg, Xupeng Li, Yu Chen, Wenhao Wang, Yuekai Jia, Sean Noble Anderson, Laila Elbeheiry, Shubham Sondhi, Yu Zhang, Zhaozhong Ni, Shoumeng Yan, Ronghui Gu, Zhengyu He:
Verifying Rust Implementation of Page Tables in a Software Enclave Hypervisor. 1218-1232 - Hongliang Qu, Zhibin Yu:
WASP: Workload-Aware Self-Replicating Page-Tables for NUMA Servers. 1233-1249 - Fabian Parzefall, Chinmay Deshpande, Felicitas Hetzelt, Michael Franz:
What You Trace is What You Get: Dynamic Stack-Layout Recovery for Binary Recompilation. 1250-1263
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.