default search action
53rd MICRO 2020: Athens, Greece
- 53rd Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2020, Athens, Greece, October 17-21, 2020. IEEE 2020, ISBN 978-1-7281-7383-2
Session 1A: Security and Privacy I
- Yeonhong Park, Woosuk Kwon, Eojin Lee, Tae Jun Ham, Jung Ho Ahn, Jae W. Lee:
Graphene: Strong yet Lightweight Row Hammer Protection. 1-13 - Alexander Freij, Shougang Yuan, Huiyang Zhou, Yan Solihin:
Persist Level Parallelism: Streamlining Integrity Tree Updates for Secure Persistent Memory. 14-27 - Zhi Zhang, Yueqiang Cheng, Dongxi Liu, Surya Nepal, Zhi Wang, Yuval Yarom:
PThammer: Cross-User-Kernel-Boundary Rowhammer through Implicit Accesses. 28-41 - Dimitrios Skarlatos, Qingrong Chen, Jianyan Chen, Tianyin Xu, Josep Torrellas:
Draco: Architectural and Operating System Support for System Call Security. 42-57
Session 1B: Machine Learning Accelerators with New Technologies
- Koki Ishida, Ilkwon Byun, Ikki Nagaoka, Kosuke Fukumitsu, Masamitsu Tanaka, Satoshi Kawakami, Teruo Tanimoto, Takatsugu Ono, Jangwoo Kim, Koji Inoue:
SuperNPU: An Extremely Fast Neural Processing Unit Using Superconducting Logic Devices. 58-72 - Muhammad Husnain Mubarik, Dennis D. Weller, Nathaniel Bleier, Matthew Tomei, Jasmin Aghassi-Hagmann, Mehdi B. Tahoori, Rakesh Kumar:
Printed Machine Learning Classifiers. 73-87 - Akshay Krishna Ramanathan, Gurpreet S. Kalsi, Srivatsa Srinivasa, Tarun Makesh Chandran, Kamlesh R. Pillai, Om Ji Omer, Vijaykrishnan Narayanan, Sreenivas Subramoney:
Look-Up Table based Energy Efficient Processing in Cache Support for Neural Network Acceleration. 88-101 - Ashutosh Dhar, Xiaohao Wang, Hubertus Franke, Jinjun Xiong, Jian Huang, Wen-Mei W. Hwu, Nam Sung Kim, Deming Chen:
FReaC Cache: Folded-logic Reconfigurable Computing in the Last Level Cache. 102-117
Session 1C: Microarchitecture I
- Siavash Zangeneh, Stephen Pruett, Sangkug Lym, Yale N. Patt:
BranchNet: A Convolutional Neural Network to Predict Hard-To-Predict Branches. 118-130 - Samira Mirbagher Ajorpaz, Elba Garza, Gilles Pokam, Daniel A. Jiménez:
CHiRP: Control-Flow History Reuse Prediction. 131-145 - Tanvir Ahmed Khan, Akshitha Sriraman, Joseph Devietti, Gilles Pokam, Heiner Litz, Baris Kasikci:
I-SPY: Context-Driven Conditional Instruction Prefetching with Coalescing. 146-159 - Jagadish B. Kotra, John Kalamatianos:
Improving the Utilization of Micro-operation Caches in x86 Processors. 160-172
Session 2A: Quantum Computing
- Casey Duckering, Jonathan M. Baker, David I. Schuster, Frederic T. Chong:
Virtualized Logical Qubits: A 2.5D Architecture for Error-Corrected Quantum Computing. 173-185 - Pranav Gokhale, Ali Javadi-Abhari, Nathan Earnest, Yunong Shi, Frederic T. Chong:
Optimized Quantum Compilation for Near-Term Algorithms with OpenPulse. 186-200 - Yongshan Ding, Pranav Gokhale, Sophia Fuhui Lin, Richard Rines, Thomas Propson, Frederic T. Chong:
Systematic Crosstalk Mitigation for Superconducting Qubits via Frequency-Aware Compilation. 201-214 - Mahabubul Alam, Abdullah Ash-Saki, Swaroop Ghosh:
Circuit Compilation Methodologies for Quantum Approximate Optimization Algorithm. 215-228
Session 2B: Robust Machine Learning
- Qiyu Wan, Xin Fu:
Fast-BCNN: Massive Neuron Skipping in Bayesian Convolutional Neural Networks. 229-240 - Yiming Gan, Yuxian Qiu, Jingwen Leng, Minyi Guo, Yuhao Zhu:
Ptolemy: Architecture Support for Robust Deep Learning. 241-255 - Gil Shomron, Uri C. Weiser:
Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks. 256-269 - Yi He, Prasanna Balaprakash, Yanjing Li:
FIdelity: Efficient Resilience Analysis Framework for Deep Learning Accelerators. 270-281
Session 2C: Memory I
- Minesh Patel, Jeremie S. Kim, Taha Shahroodi, Hasan Hassan, Onur Mutlu:
Bit-Exact ECC Recovery (BEER): Determining DRAM On-Die ECC Functions by Exploiting DRAM Data Retention Characteristics. 282-297 - Lev Mukhanov, Dimitrios S. Nikolopoulos, Georgios Karakonstantis:
DStress: Automatic Synthesis of DRAM Reliability Stress Viruses using Genetic Algorithms. 298-312 - Yaohua Wang, Lois Orosa, Xiangjun Peng, Yang Guo, Saugata Ghose, Minesh Patel, Jeremie S. Kim, Juan Gómez-Luna, Mohammad Sadrosadati, Nika Mansouri-Ghiasi, Onur Mutlu:
FIGARO: Improving System Performance via Fine-Grained In-DRAM Data Relocation and Caching. 313-328 - Themis Melissaris, Markos Markakis, Kelly A. Shaw, Margaret Martonosi:
PerpLE: Improving the Speed and Effectiveness of Memory Consistency Testing. 329-341
Session 3A: Near/In-Memory Computing
- Dibei Chen, Zhaoshi Li, Tianzhu Xiong, Zhiwei Liu, Jun Yang, Shouyi Yin, Shaojun Wei, Leibo Liu:
CATCAM: Constant-time Alteration Ternary CAM with Scalable In-Memory Architecture. 342-355 - Mohsen Imani, Saikishan Pampana, Saransh Gupta, Minxuan Zhou, Yeseong Kim, Tajana Rosing:
DUAL: Acceleration of Clustering Algorithms using Digital-based Processing In-Memory. 356-371 - Mingxuan He, Choungki Song, Ilkon Kim, Chunseok Jeong, Seho Kim, Il Park, Mithuna Thottethodi, T. N. Vijaykumar:
Newton: A DRAM-maker's Accelerator-in-Memory (AiM) Architecture for Machine Learning. 372-385 - Shuotao Xu, Thomas Bourgeat, Tianhao Huang, Hojun Kim, Sungjin Lee, Arvind:
AQUOMAN: An Analytic-Query Offloading Machine. 386-399 - Salonik Resch, S. Karen Khatamifard, Zamshed I. Chowdhury, Masoud Zabihi, Zhengyang Zhao, M. Hüsrev Cilasun, Jianping Wang, Sachin S. Sapatnekar, Ulya R. Karpuzcu:
MOUSE: Inference In Non-volatile Memory for Energy Harvesting Applications. 400-414
Session 3B: Compilation, Modeling, and Simulation
- Jinhu Jiang, Rongchao Dong, Zhongjun Zhou, Changheng Song, Wenwen Wang, Pen-Chung Yew, Weihua Zhang:
More with Less - Deriving More Translation Rules with Less Training Data for DBTs Using Parameterization. 415-426 - Jie Zhao, Peng Di:
Optimizing the Memory Hierarchy by Compositing Automatic Transformations on Computations and Data. 427-441 - Alex Renda, Yishen Chen, Charith Mendis, Michael Carbin:
DiffTune: Optimizing CPU Simulator Parameters with Learned Differentiable Surrogates. 442-455 - Mohammad Agbarya, Idan Yaniv, Jayneel Gandhi, Dan Tsafrir:
Predicting Execution Times With Partial Simulations in Virtual Memory Research: Why and How. 456-470 - Samuel Rogers, Joshua Slycord, Mohammadreza Baharani, Hamed Tabkhi:
gem5-SALAM: A System Architecture for LLVM-based Accelerator Modeling. 471-482
Session 3C: Non-volatile Memories
- Qiao Li, Min Ye, Yufei Cui, Liang Shi, Xiaoqiang Li, Tei-Wei Kuo, Chun Jason Xue:
Shaving Retries with Sentinels for Fast Read over High-Density 3D Flash. 483-495 - Zixuan Wang, Xiao Liu, Jian Yang, Theodore Michailidis, Steven Swanson, Jishen Zhao:
Characterizing and Modeling Non-Volatile Memory Systems. 496-508 - Apostolos Kokolis, Thomas Shull, Jian Huang, Josep Torrellas:
P-INSPECT: Architectural Support for Programmable Non-Volatile Memory Frameworks. 509-524 - Jungi Jeong, Jaewan Hong, Seungryoul Maeng, Changhee Jung, Youngjin Kwon:
Unbounded Hardware Transactional Memory for a Hybrid DRAM/NVM Memory System. 525-538 - Sara Mahdizadeh-Shahri, Seyed Armin Vakil-Ghahani, Aasheesh Kolli:
(Almost) Fence-less Persist Ordering. 539-554
Session 4A: Microarchitecture II
- Alberto Ros, Stefanos Kaxiras:
Speculative Enforcement of Store Atomicity. 555-567 - Juan M. Cebrian, Stefanos Kaxiras, Alberto Ros:
Boosting Store Buffer Efficiency with Store-Prefetch Bursts. 568-580 - Minli Julie Liao, Jack Sampson:
D-SOAP: Dynamic Spatial Orientation Affinity Prediction for Caching in Multi-Orientation Memory Systems. 581-595 - Quan M. Nguyen, Daniel Sánchez:
Pipette: Improving Core Utilization on Irregular Applications through Intra-Core Pipeline Parallelism. 596-608 - Chao Zhang, Yuan Zeng, John Shalf, Xiaochen Guo:
RnR: A Software-Assisted Record-and-Replay Hardware Prefetcher. 609-621
Session 4B: Resource Management
- Sheng-Chun Kao, Geonhwa Jeong, Tushar Krishna:
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning. 622-636 - Liang Zhou, Laxmi N. Bhuyan, K. K. Ramakrishnan:
Gemini: Learning to Manage CPU Power for Latency-Critical Search Engines. 637-349 - Neeraj Kulkarni, Gonzalo Gonzalez-Pumariega, Amulya Khurana, Christine A. Shoemaker, Christina Delimitrou, David H. Albonesi:
CuttleSys: Data-Driven Resource Management for Interactive Services on Reconfigurable Multicores. 650-664 - Brian C. Schwedock, Nathan Beckmann:
Jumanji: The Case for Dynamic NUCA in the Datacenter. 665-680 - Soroush Ghodrati, Byung Hoon Ahn, Joon Kyung Kim, Sean Kinzer, Brahmendra Reddy Yatham, Navateja Alla, Hardik Sharma, Mohammad Alian, Eiman Ebrahimi, Nam Sung Kim, Cliff Young, Hadi Esmaeilzadeh:
Planaria: Dynamic Architecture Fission for Spatial Multi-Tenant Acceleration of Deep Neural Networks. 681-697
Session 4C: Machine Learning Accelerators I
- Zhuoran Song, Feiyang Wu, Xueyuan Liu, Jing Ke, Naifeng Jing, Xiaoyao Liang:
VR-DANN: Real-Time Video Recognition via Decoder-Assisted Neural Network Acceleration. 698-710 - Dingqing Yang, Amin Ghasemazar, Xiaowei Ren, Maximilian Golub, Guy Lemieux, Mieszko Lis:
Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training. 711-724 - Hyeonjin Kim, Sungwoo Ahn, Yunho Oh, Bogil Kim, Won Woo Ro, William J. Song:
Duplo: Lifting Redundant Memory Accesses of Deep Neural Networks for GPU Tensor Cores. 725-737 - Liu Liu, Zheng Qu, Lei Deng, Fengbin Tu, Shuangchen Li, Xing Hu, Zhenyu Gu, Yufei Ding, Yuan Xie:
DUET: Boosting Deep Neural Network Efficiency on Dual-Module Architecture. 738-750
Session 5A: Machine Learning Accelerators II
- Huiyu Mo, Leibo Liu, Wenjing Hu, Wenping Zhu, Qiang Li, Ang Li, Shouyi Yin, Jian Chen, Xiaowei Jiang, Shaojun Wei:
TFE: Energy-efficient Transferred Filter-based Engine to Compress and Accelerate Convolutional Neural Networks. 751-765 - Nitish Kumar Srivastava, Hanchen Jin, Jie Liu, David H. Albonesi, Zhiru Zhang:
MatRaptor: A Sparse-Sparse Matrix Multiplication Accelerator Based on Row-Wise Product. 766-780 - Mostafa Mahmoud, Isak Edo, Ali Hadi Zadeh, Omar Mohamed Awad, Gennady Pekhimenko, Jorge Albericio, Andreas Moshovos:
TensorDash: Exploiting Sparsity to Accelerate Deep Neural Network Training. 781-795 - Zhangxiaowen Gong, Houxiang Ji, Christopher W. Fletcher, Christopher J. Hughes, Sara S. Baghsorkhi, Josep Torrellas:
SAVE: Sparsity-Aware Vector Engine for Accelerating DNN Training and Inference on CPUs. 796-810 - Ali Hadi Zadeh, Isak Edo, Omar Mohamed Awad, Andreas Moshovos:
GOBO: Quantizing Attention-Based NLP Models for Low Latency and Energy Efficient Inference. 811-824
Session 5B: Cloud and Datacenter
- Pyeongsu Park, Heetaek Jeong, Jangwoo Kim:
TrainBox: An Extreme-Scale Neural Network Training Server Architecture by Systematically Balancing Operations. 825-838 - Sulav Malla, Qingyuan Deng, Zoh Ebrahimzadeh, Joe Gasperetti, Sajal Jain, Parimala Kondety, Thiara Ortiz, Debra Vieira:
Coordinated Priority-aware Charging of Distributed Batteries in Oversubscribed Data Centers. 839-851 - Amirhossein Mirhosseini, Hossein Golestani, Thomas F. Wenisch:
HyperPlane: A Scalable Low-Latency Notification Accelerator for Software Data Planes. 852-867 - Christian Pinto, Dimitris Syrivelis, Michele Gazzetti, Panos K. Koutsovasilis, Andrea Reale, Kostas Katrinis, H. Peter Hofstee:
ThymesisFlow: A Software-Defined, HW/SW co-Designed Interconnect Stack for Rack-Scale Memory Disaggregation. 868-880 - Tianyi Liu, Sen He, Sunzhou Huang, Danny H. K. Tsang, Lingjia Tang, Jason Mars, Wei Wang:
A Benchmarking Framework for Interactive 3D Applications in the Cloud. 881-894
Session 5C: Domain-Specific Architecture
- Pengcheng Yao, Long Zheng, Zhen Zeng, Yu Huang, Chuangyi Gui, Xiaofei Liao, Hai Jin, Jingling Xue:
A Locality-Aware Energy-Efficient Accelerator for Graph Mining Applications. 895-907 - Shafiur Rahman, Nael B. Abu-Ghazaleh, Rajiv Gupta:
GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph Processing. 908-921 - Tong Geng, Ang Li, Runbin Shi, Chunshu Wu, Tianqi Wang, Yanfei Li, Pouya Haghi, Antonino Tumeo, Shuai Che, Steven K. Reinhardt, Martin C. Herbordt:
AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing. 922-936 - Daichi Fujiki, Shunhao Wu, Nathan Ozog, Kush Goliya, David T. Blaauw, Satish Narayanasamy, Reetuparna Das:
SeedEx: A Genome Sequencing Accelerator for Optimal Alignments in Subminimal Space. 937-950 - Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gómez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu:
GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis. 951-966
Session 6A: GPGPU
- Xia Zhao, Magnus Jahre, Lieven Eeckhout:
Selective Replication in Memory-Side GPU Caches. 967-980 - Yuan-Hsi Chou, Christopher Ng, Shaylin Cattell, Jeremy Intan, Matthew D. Sinclair, Joseph Devietti, Timothy G. Rogers, Tor M. Aamodt:
Deterministic Atomic Buffering. 981-995 - Hodjat Asghari Esfeden, AmirAli Abdolrashidi, Shafiur Rahman, Daniel Wong, Nael B. Abu-Ghazaleh:
BOW: Breathing Operand Windows to Exploit Bypassing in GPUs. 996-1008 - Lu Wang, Magnus Jahre, Almutaz Adileh, Lieven Eeckhout:
MDM: The GPU Memory Divergence Model. 1009-1021 - Mahmoud Khairy, Vadim Nikiforov, David W. Nellans, Timothy G. Rogers:
Locality-Centric Data and Threadblock Management for Massive GPUs. 1022-1036
Session 6B: Mobile and Embedded Architecture
- Yu Feng, Boyuan Tian, Tiancheng Xu, Paul N. Whatmough, Yuhao Zhu:
Mesorasi: Architecture Support for Point Cloud Analytics via Delayed-Aggregation. 1037-1050 - Jawad Haj-Yahya, Mohammed Alser, Jeremie S. Kim, Lois Orosa, Efraim Rotem, Avi Mendelson, Anupam Chattopadhyay, Onur Mutlu:
FlexWatts: A Power- and Workload-Aware Hybrid Power Delivery Network for Energy-Efficient Microprocessors. 1051-1066 - Bo Yu, Wei Hu, Leimeng Xu, Jie Tang, Shaoshan Liu, Yuhao Zhu:
Building the Computing System for Autonomous Micromobility Vehicles: Design Constraints and Architectural Optimizations. 1067-1081 - Young Geun Kim, Carole-Jean Wu:
AutoScale: Energy Efficiency Optimization for Stochastic Edge Inference Using Reinforcement Learning. 1082-1096 - Tianyu Jia, Yuhao Ju, Russ Joseph, Jie Gu:
NCPU: An Embedded Neural CPU Architecture on Resource-Constrained Low Power Devices for Real-time End-to-End Performance. 1097-1109
Session 6C: Security and Privacy II
- Thomas Bourgeat, Jules Drean, Yuheng Yang, Lillian Tsai, Joel S. Emer, Mengjia Yan:
CaSA: End-to-end Quantitative Security Analysis of Randomly Mapped Caches. 1110-1123 - Samira Mirbagher Ajorpaz, Gilles Pokam, Esmaeil Mohammadian Koruyeh, Elba Garza, Nael B. Abu-Ghazaleh, Daniel A. Jiménez:
PerSpectron: Detecting Invariant Footprints of Microarchitectural Attacks with Perceptron. 1124-1137 - Zirui Neil Zhao, Houxiang Ji, Mengjia Yan, Jiyong Yu, Christopher W. Fletcher, Adam Morrison, Darko Marinov, Josep Torrellas:
Speculation Invariance (InvarSpec): Faster Safe Execution Through Program Analysis. 1138-1152 - Yonghae Kim, Jaekyu Lee, Hyesoon Kim:
Hardware-based Always-On Heap Memory Safety. 1153-1166
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.