default search action
ASPLOS 2023: Vancouver, BC, Canada
- Tor M. Aamodt, Natalie D. Enright Jerger, Michael M. Swift:
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, ASPLOS 2023, Vancouver, BC, Canada, March 25-29, 2023. ACM 2023, ISBN 978-1-4503-9918-0
Keynotes
- Abhishek Bhattacharjee:
Direct Mind-Machine Teaming (Keynote). 1 - Bryan Catanzaro:
Language Models: The Most Important Compute Challenge of Our Time (Keynote). 2
Papers
- Boyu Tian, Qihang Chen, Mingyu Gao:
ABNDP: Co-optimizing Data Access and Load Balance in Near-Data Processing. 3-17 - Toluwanimi O. Odemuyiwa, Hadi Asghari Moghaddam, Michael Pellauer, Kartik Hegde, Po-An Tsai, Neal Clayton Crago, Aamer Jaleel, John D. Owens, Edgar Solomonik, Joel S. Emer, Christopher W. Fletcher:
Accelerating Sparse Data Orchestration via Dynamic Reflexive Tiling. 18-32 - Jackson Melchert, Kathleen Feng, Caleb Donovick, Ross Daly, Ritvik Sharma, Clark W. Barrett, Mark A. Horowitz, Pat Hanrahan, Priyanka Raina:
APEX: A Framework for Automated Processing Element Design Space Exploration using Frequent Subgraph Analysis. 33-45 - Lin Cheng, Max Ruttenberg, Dai Cheol Jung, Dustin Richmond, Michael B. Taylor, Mark Oskin, Christopher Batten:
Beyond Static Parallel Loops: Supporting Dynamic Task Parallelism on Manycore Architectures with Software-Managed Scratchpad Memories. 46-58 - Fei Hua, Yuwei Jin, Yan-Hao Chen, Suhas Vittal, Kevin Krsulich, Lev S. Bishop, John Lapeyre, Ali Javadi-Abhari, Eddy Z. Zhang:
CaQR: A Compiler-Assisted Approach for Qubit Reuse through Dynamic Circuit. 59-71 - Xiangyu Gao, Divya Raghunathan, Ruijie Fang, Tao Wang, Xiaotong Zhu, Anirudh Sivaraman, Srinivas Narayana, Aarti Gupta:
CaT: A Solver-Aided Compiler for Packet-Processing Pipelines. 72-88 - Karthik Garimella, Zahra Ghodsi, Nandan Kumar Jha, Siddharth Garg, Brandon Reagen:
Characterizing and Optimizing End-to-End Systems for Private Inference. 89-104 - Tianrui Wei, Nazerke Turtayeva, Marcelo Orenes-Vera, Omkar Lonkar, Jonathan Balkind:
Cohort: Software-Oriented Acceleration for Heterogeneous SoCs. 105-117 - Raghav Malik, Kabir Sheth, Milind Kulkarni:
Coyote: A Compiler for Vectorizing Encrypted Arithmetic Circuits. 118-133 - Edward Hanson, Mark Horton, Hai (Helen) Li, Yiran Chen:
DefT: Boosting Scalability of Deformable Convolution Operations on GPUs. 134-146 - Junyi Shu, Ruidong Zhu, Yun Ma, Gang Huang, Hong Mei, Xuanzhe Liu, Xin Jin:
Disaggregated RAID Storage in Modern Datacenters. 147-163 - Mao Lin, Keren Zhou, Pengfei Su:
DrGPUM: Guiding Memory Optimization for GPU-Accelerated Applications. 164-178 - Ashwini Raina, Jianan Lu, Asaf Cidon, Michael J. Freedman:
Efficient Compactions between Storage Tiers with PrismDB. 179-193 - Teng Ma, Shanpei Chen, Yihao Wu, Erwei Deng, Zhuo Song, Quan Chen, Minyi Guo:
Efficient Scheduler Live Update for Linux Kernel with Modularization. 194-207 - Alessandro Rivitti, Roberto Bifulco, Angelo Tulumello, Marco Bonola, Salvatore Pontarelli:
eHDL: Turning eBPF/XDP Programs into Hardware Designs for the NIC. 208-223 - Kenichi Yasukata, Hajime Tazaki, Pierre-Louis Aublin:
Exit-Less, Isolated, and Shared Access for Virtual Machines. 224-237 - Shaohua Li, Zhendong Su:
Finding Unstable Code via Compiler-Driven Differential Testing. 238-251 - Francisco Muñoz-Martínez, Raveesh Garg, Michael Pellauer, José L. Abellán, Manuel E. Acacio, Tushar Krishna:
Flexagon: A Multi-dataflow Sparse-Sparse Matrix Multiplication Accelerator for Efficient DNN Processing. 252-265 - Shravan Narayan, Tal Garfinkel, Mohammadkazem Taram, Joey Rudek, Daniel Moghimi, Evan Johnson, Chris Fallin, Anjo Vahldiek-Oberwagner, Michael LeMay, Ravi Sahita, Dean M. Tullsen, Deian Stefan:
Going beyond the Limits of SFI: Flexible and Secure Hardware-Assisted In-Process Isolation with HFI. 266-281 - Haojie Ye, Sanketh Vedula, Yuhan Chen, Yichen Yang, Alex M. Bronstein, Ronald G. Dreslinski, Trevor N. Mudge, Nishil Talati:
GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference. 282-301 - Bastian Hagedorn, Bin Fan, Hanfeng Chen, Cris Cecka, Michael Garland, Vinod Grover:
Graphene: An IR for Optimized Tensor Computations on GPUs. 302-313 - Jun Bi, Qi Guo, Xiaqing Li, Yongwei Zhao, Yuanbo Wen, Yuxuan Guo, Enshuai Zhou, Xing Hu, Zidong Du, Ling Li, Huaping Chen, Tianshi Chen:
Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators. 314-328 - Tushar Swamy, Annus Zulfiqar, Luigi Nardi, Muhammad Shahbaz, Kunle Olukotun:
Homunculus: Auto-Generating Efficient Data-Plane ML Pipelines for Datacenter Networks. 329-342 - Sheng Li, Garrett Andersen, Tao Chen, Liqun Cheng, Julian Grady, Da Huang, Quoc V. Le, Andrew Li, Xin Li, Yang Li, Chen Liang, Yifeng Lu, Yun Ni, Ruoming Pang, Mingxing Tan, Martin Wicke, Gang Wu, Shengqi Zhu, Parthasarathy Ranganathan, Norman P. Jouppi:
Hyperscale Hardware Optimized Neural Architecture Search. 343-358 - Zhengrong Wang, Christopher Liu, Aman Arora, Lizy Kurian John, Tony Nowatzki:
Infinity Stream: Portable and Programmer-Friendly In-/Near-Memory Fusion. 359-375 - Shuo Liu, Qiaoling Wang, Junyi Zhang, Wenfei Wu, Qinliang Lin, Yao Liu, Meng Xu, Marco Canini, Ray C. C. Cheung, Jianfei He:
In-Network Aggregation with Transport Transparency for Distributed Training. 376-391 - Bradley Denby, Krishna Chintalapudi, Ranveer Chandra, Brandon Lucia, Shadi A. Noghabi:
Kodan: Addressing the Computational Bottleneck in Space. 392-403 - Chong Zhang, Songfan Li, Yihang Song, Qianhe Meng, Minghua Chen, Yanxu Bai, Li Lu, Hongzi Zhu:
LEGO: Empowering Chip-Level Functionality Plug-and-Play for Next-Generation IoT Devices. 404-418 - Ouwen Jin, Qinghui Xing, Ying Li, Shuiguang Deng, Shuibing He, Gang Pan:
Mapping Very Large Scale Spiking Neuron Network to Neuromorphic Hardware. 419-432 - Krishnan Gosakan, Jaehyun Han, William Kuszmaul, Ibrahim N. Mubarek, Nirjhar Mukherjee, Karthik Sriram, Guido Tagliavini, Evan West, Michael A. Bender, Abhishek Bhattacharjee, Alex Conway, Martin Farach-Colton, Jayneel Gandhi, Rob Johnson, Sudarsun Kannan, Donald E. Porter:
Mosaic Pages: Big TLB Reach with Small Pages. 433-448 - Samuel Hsia, Udit Gupta, Bilge Acun, Newsha Ardalani, Pan Zhong, Gu-Yeon Wei, David Brooks, Carole-Jean Wu:
MP-Rec: Hardware-Software Co-design to Enable Multi-path Recommendation. 449-465 - Shuke Wang, Mingxing Zhang, Ke Yang, Kang Chen, Shaonan Ma, Jinlei Jiang, Yongwei Wu:
NosWalker: A Decoupled Architecture for Out-of-Core Random Walk Processing. 466-482 - Zhongcheng Zhang, Yan Ou, Ying Liu, Chenxi Wang, Yongbin Zhou, Xiaoyu Wang, Yuyang Zhang, Yucheng Ouyang, Jiahao Shan, Ying Wang, Jingling Xue, Huimin Cui, Xiaobing Feng:
Occamy: Elastically Sharing a SIMD Co-processor across Multiple CPU Cores. 483-497 - Chaoyi Ruan, Yingqiang Zhang, Chao Bi, Xiaosong Ma, Hao Chen, Feifei Li, Xinjun Yang, Cheng Li, Ashraf Aboulnaga, Yinlong Xu:
Persistent Memory Disaggregation for Cloud-Native Relational Databases. 498-512 - Chase Norman, Adwait Godbole, Yatin A. Manerkar:
PipeSynth: Automated Synthesis of Microarchitectural Axioms for Memory Consistency. 513-527 - Christopher Jelesnianski, Mohannad Ismail, Yeongjin Jang, Dan Williams, Changwoo Min:
Protect the System Call, Protect (Most of) the World with BASTION. 528-541 - Mohammadamin Ajdari, Pouria Peykani Sani, Amirhossein Moradi, Masoud Khanalizadeh Imani, Amir Hossein Bazkhanei, Hossein Asadi:
Re-architecting I/O Caches for Emerging Fast Storage Devices. 542-555 - Joshua Landgraf, Matthew Giordano, Esther Yoon, Christopher J. Rossbach:
Reconfigurable Virtual Memory for FPGA-Driven I/O. 556-571 - Haoyuan Wang, Scott Beamer:
RepCut: Superlinear Parallel RTL Simulation with Replication-Aided Partitioning. 572-585 - Moein Khazraee, Alex Forencich, George C. Papen, Alex C. Snoeren, Aaron Schulman:
Rosebud: Making FPGA-Accelerated Middlebox Development More Pleasant. 586-605 - Kevin Laeufer, Vighnesh Iyer, David Biancolin, Jonathan Bachrach, Borivoje Nikolic, Koushik Sen:
Simulator Independent Coverage for RTL Hardware Languages. 606-615 - Blaise Tine, Varun Saxena, Santosh Srivatsan, Joshua R. Simpson, Fadi Alzammar, Liam Cooper, Hyesoon Kim:
Skybox: Open-Source Graphic Rendering on Programmable RISC-V GPUs. 616-630 - Fangkai Yang, Lu Wang, Zhenyu Xu, Jue Zhang, Liqun Li, Bo Qiao, Camille Couturier, Chetan Bansal, Soumya Ram, Si Qin, Zhen Ma, Íñigo Goiri, Eli Cortez, Terry Yang, Victor Rühle, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs. 631-643 - Jiesong Liu, Feng Zhang, Jiawei Guan, Hsin-Hsuan Sung, Xiaoguang Guo, Xiaoyong Du, Xipeng Shen:
Space-Efficient TREC for Enabling Deep Learning on Microcontrollers. 644-659 - Zihao Ye, Ruihang Lai, Junru Shao, Tianqi Chen, Luis Ceze:
SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning. 660-678 - Zujun Tan, Yebin Chon, Michael Kruse, Johannes Doerfert, Ziyang Xu, Brian Homerding, Simone Campanoni, David I. August:
SPLENDID: Supporting Parallel LLVM-IR Enhanced Natural Decompilation for Interactive Development. 679-693 - Iacovos G. Kolokasis, Giannos Evdorou, Shoaib Akram, Christos Kozanitis, Anastasios Papagiannis, Foivos S. Zakkak, Polyvios Pratikakis, Angelos Bilas:
TeraHeap: Reducing Memory Pressure in Managed Big Data Frameworks. 694-709 - Olivia Hsu, Maxwell Strange, Ritvik Sharma, Jaeyeon Won, Kunle Olukotun, Joel S. Emer, Mark A. Horowitz, Fredrik Kjølstad:
The Sparse Abstract Machine. 710-726 - Padmapriya Duraisamy, Wei Xu, Scott Hare, Ravi Rajwar, David E. Culler, Zhiyi Xu, Jianing Fan, Christopher Kennelly, Bill McCloskey, Danijela Mijailovic, Brian Morris, Chiranjit Mukherjee, Jingliang Ren, Greg Thelen, Paul Turner, Carlos Villavieja, Parthasarathy Ranganathan, Amin Vahdat:
Towards an Adaptable Systems Architecture for Memory Tiering at Warehouse-Scale. 727-741 - Hasan Al Maruf, Hao Wang, Abhishek Dhanotia, Johannes Weiner, Niket Agarwal, Pallab Bhattacharya, Chris Petersen, Mosharaf Chowdhury, Shobhit O. Kanaujia, Prakash Chauhan:
TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory. 742-755 - Zizhan Chen, Zili Shao:
Transparent Runtime Change Handling for Android Apps. 756-770 - Zirui Neil Zhao, Adam Morrison, Christopher W. Fletcher, Josep Torrellas:
Untangle: A Principled Framework to Design Low-Leakage, High-Performance Dynamic Partitioning Schemes. 771-788 - Yuan Feng, Yingte Xu:
Verification of Nondeterministic Quantum Programs. 789-805 - Gefei Zuo, Jiacheng Ma, Andrew Quinn, Baris Kasikci:
Vidi: Record Replay for Reconfigurable Hardware. 806-820
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.