default search action
49th ISCA 2022: New York, NY, USA
- Valentina Salapura, Mohamed Zahran, Fred Chong, Lingjia Tang:
ISCA '22: The 49th Annual International Symposium on Computer Architecture, New York, New York, USA, June 18 - 22, 2022. ACM 2022, ISBN 978-1-4503-8610-4 - Abhishek Bhattacharyya, Abhijith Somashekhar, Joshua San Miguel:
NvMR: non-volatile memory renaming for intermittent computing. 1-13 - Ashkan Asgharzadeh, Juan M. Cebrian, Arthur Perais, Stefanos Kaxiras, Alberto Ros:
Free atomics: hardware atomic operations without fences. 14-26 - Jaewon Lee, Yonghae Kim, Jiashen Cao, Euna Kim, Jaekyu Lee, Hyesoon Kim:
Securing GPU via region-based bounds checking. 27-41 - Brian C. Schwedock, Piratach Yoovidhya, Jennifer Seibert, Nathan Beckmann:
täk¯: a polymorphic cache hierarchy for general-purpose optimization of data movement. 42-58 - Samuel Alexander Stein, Nathan Wiebe, Yufei Ding, Bo Peng, Karol Kowalski, Nathan A. Baker, James A. Ang, Ang Li:
EQC: ensembled quantum computing for variational quantum algorithms. 59-71 - Nicholas Mosier, Hanna Lachnitt, Hamed Nemati, Caroline Trippel:
Axiomatic hardware-software contracts for security. 72-86 - Xing Zhou, Zhilei Xu, Cong Wang, Mingyu Gao:
PPMLAC: high performance chipset architecture for secure multi-party computation. 87-101 - Jilan Lin, Ling Liang, Zheng Qu, Ishtiyaque Ahmad, Liu Liu, Fengbin Tu, Trinabh Gupta, Yufei Ding, Yuan Xie:
INSPIRE: in-storage private information retrieval via protocol and architecture co-design. 102-115 - Jin Zhao, Yun Yang, Yu Zhang, Xiaofei Liao, Lin Gu, Ligang He, Bingsheng He, Hai Jin, Haikun Liu, Xinyu Jiang, Hui Yu:
TDGraph: a topology-driven accelerator for high-performance streaming graph processing. 116-129 - Guohao Dai, Zhenhua Zhu, Tianyu Fu, Chiyue Wei, Bangyan Wang, Xiangyu Li, Yuan Xie, Huazhong Yang, Yu Wang:
DIMMining: pruning-efficient and parallel graph mining on near-memory-computing. 130-145 - Nishil Talati, Haojie Ye, Yichen Yang, Leul Belayneh, Kuan-Yu Chen, David T. Blaauw, Trevor N. Mudge, Ronald G. Dreslinski:
NDMiner: accelerating graph pattern mining using near data processing. 146-159 - Muhammad Umar, Weizhe Hua, Zhiru Zhang, G. Edward Suh:
SoftVN: efficient memory protection via software-provided version numbers. 160-172 - Nikola Samardzic, Axel Feldmann, Aleksandar Krastev, Nathan Manohar, Nicholas Genise, Srinivas Devadas, Karim Eldefrawy, Chris Peikert, Daniel Sánchez:
CraterLake: a hardware accelerator for efficient unbounded computation on encrypted data. 173-187 - Gang Liu, Kenli Li, Zheng Xiao, Rujia Wang:
PS-ORAM: efficient crash consistency support for oblivious RAM on NVM. 188-203 - Jack Cook, Jules Drean, Jonathan Behrens, Mengjia Yan:
There's always a bigger fish: a clarifying analysis of a machine-learning-assisted side-channel attack. 204-217 - Marzieh Lenjani, Alif Ahmed, Mircea Stan, Kevin Skadron:
Gearbox: a case for supporting accumulation dispatching and hybrid partitioning in PIM-based accelerators. 218-230 - Alexandar Devic, Siddhartha Balakrishna Rai, Anand Sivasubramaniam, Ameen Akel, Sean Eilert, Justin Eno:
To PIM or not for emerging general purpose processing in DDR memory systems. 231-244 - Siying Feng, Xin He, Kuan-Yu Chen, Liu Ke, Xuan Zhang, David T. Blaauw, Trevor N. Mudge, Ronald G. Dreslinski:
MeNDA: a near-memory multi-way merge solution for sparse transposition and dataflows. 245-258 - Xingchen Man, Jianfeng Zhu, Guihuan Song, Shouyi Yin, Shaojun Wei, Leibo Liu:
CaSMap: agile mapper for reconfigurable spatial architectures by automatically clustering intermediate representations and scattering mapping process. 259-273 - Yuanchao Xu, Chencheng Ye, Yan Solihin, Xipeng Shen:
FFCCD: fence-free crash-consistent concurrent defragmentation for persistent memory. 274-288 - Sangwon Lee, Miryeong Kwon, Gyuyoung Park, Myoungsoo Jung:
LightPC: hardware and software co-design for energy-efficient full system persistence. 289-305 - Ahmed H. M. O. Abulila, Izzat El Hajj, Myoungsoo Jung, Nam Sung Kim:
ASAP: architecture support for asynchronous persistence. 306-319 - Gagandeep Singh, Rakesh Nadig, Jisung Park, Rahul Bera, Nastaran Hajinazar, David Novo, Juan Gómez-Luna, Sander Stuijk, Henk Corporaal, Onur Mutlu:
Sibyl: adaptive and extensible data placement in hybrid storage systems using online reinforcement learning. 320-336 - Anbang Wu, Gushu Li, Hezi Zhang, Gian Giacomo Guerreschi, Yufei Ding, Yuan Xie:
A synthesis framework for stitching surface code with superconducting quantum devices. 337-350 - Lingling Lao, Dan E. Browne:
2QAN: a quantum compiler for 2-local qubit hamiltonian simulation algorithms. 351-365 - Ilkwon Byun, Junpyo Kim, Dongmoon Min, Ikki Nagaoka, Kosuke Fukumitsu, Iori Ishikawa, Teruo Tanimoto, Masamitsu Tanaka, Koji Inoue, Jangwoo Kim:
XQsim: modeling cross-technology control processors for 10+K qubit quantum computers. 366-382 - Tirthak Patel, Daniel Silver, Devesh Tiwari:
Geyser: a compilation framework for quantum computing with neutral atoms. 383-395 - Ali Sedaghati, Milad Hakimi, Reza Hojabr, Arrvindh Shriraman:
X-cache: a modular architecture for domain-specific caches. 396-409 - Sudhanshu Shukla, Sumeet Bandishte, Jayesh Gaur, Sreenivas Subramoney:
Register file prefetching. 410-423 - Jounghoo Lee, Yeonan Ha, Suhyun Lee, Jinyoung Woo, Jinho Lee, Hanhwi Jang, Youngsok Kim:
GCoM: a detailed GPU core model for accurate analytical modeling of modern GPUs. 424-436 - Gilead Posluns, Yan Zhu, Guowei Zhang, Mark C. Jeffrey:
A scalable architecture for reprioritizing ordered parallelism. 437-453 - Nathaniel Bleier, Muhammad Husnain Mubarik, Srijan Chakraborty, Shreyas Kishore, Rakesh Kumar:
Rethinking programmable wearable processors. 454-467 - Di Wu, Jingjie Li, Zhewen Pan, Younghyun Kim, Joshua San Miguel:
uBrain: a unary brain computer interface. 468-481 - Dehui Lin, Yasamin Tabatabaee, Yash Pote, Djordje Jevdjic:
Managing reliability skew in DNA storage. 482-494 - Robert Hanhan, Esteban Garzón, Zuher Jahshan, Adam Teman, Marco Lanuzza, Leonid Yavits:
EDAM: edit distance tolerant approximate matching content addressable memory. 495-507 - Anshujit Sharma, Richard Afoakwa, Zeljko Ignjatovic, Michael C. Huang:
Increasing ising machine capacity with multi-chip architectures. 508-521 - Edward Hanson, Shiyu Li, Hai Helen Li, Yiran Chen:
Cascading structured pruning: enabling high data reuse for sparse DNN accelerators. 522-535 - Jonathan S. Lew, Yunpeng Liu, Wenyi Gong, Negar Goli, R. David Evans, Tor M. Aamodt:
Anticipating and eliminating redundant computations in accelerated sparse training. 536-551 - Yunan Zhang, Po-An Tsai, Hung-Wei Tseng:
SIMD2: a generalized matrix instruction set for accelerating tensor computation beyond GEMM. 552-566 - Dennis Abts, Garrin Kimmell, Andrew C. Ling, John Kim, Matthew Boyd, Andrew Bitar, Sahil Parmar, Ibrahim Ahmed, Roberto DiCecco, David Han, John Thompson, Michael Bye, Jennifer Hwang, Jeremy Fowers, Peter Lillian, Ashwin Murthy, Elyas Mehtabuddin, Chetan Tekur, Thomas Sohmers, Kris Kang, Stephen Maresh, Jonathan Ross:
A software-defined tensor streaming multiprocessor for large-scale machine learning. 567-580 - Saeed Rashidi, William Won, Sudarshan Srinivasan, Srinivas Sridharan, Tushar Krishna:
Themis: a network bandwidth-aware collective scheduling policy for distributed training of DL models. 581-596 - Mohammad Bakhshalipour, Seyed Borna Ehsani, Mohamad Qadri, Dominic Guri, Maxim Likhachev, Phillip B. Gibbons:
RACOD: algorithm/hardware co-design for mobile robot path planning. 597-609 - Haoran You, Cheng Wan, Yang Zhao, Zhongzhi Yu, Yonggan Fu, Jiayi Yuan, Shang Wu, Shunyao Zhang, Yongan Zhang, Chaojian Li, Vivek Boominathan, Ashok Veeraraghavan, Ziyun Li, Yingyan Lin:
EyeCoD: eye tracking system acceleration via flatcam-based algorithm & accelerator co-design. 610-622 - Helena Caminal, Yannis Chronis, Tianshu Wu, Jignesh M. Patel, José F. Martínez:
Accelerating database analytic query workloads using an associative processor. 623-637 - Damla Senol Cali, Konstantinos Kanellopoulos, Joël Lindegger, Zülal Bingöl, Gurpreet S. Kalsi, Ziyi Zuo, Can Firtina, Meryem Banu Cavlak, Jeremie S. Kim, Nika Mansouri-Ghiasi, Gagandeep Singh, Juan Gómez-Luna, Nour Almadhoun Alserr, Mohammed Alser, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu:
SeGraM: a universal hardware accelerator for genomic sequence-to-graph and sequence-to-sequence mapping. 638-655 - Zhuowen Zou, Hanning Chen, Prathyush Poduval, Yeseong Kim, Mahdi Imani, Elaheh Sadredini, Rosario Cammarota, Mohsen Imani:
BioHD: an efficient genome sequence search platform using HyperDimensional memorization. 656-669 - Kevin Loughlin, Stefan Saroiu, Alec Wolman, Yatin A. Manerkar, Baris Kasikci:
MOESI-prime: preventing coherence-induced hammering in commodity workloads. 670-684 - Joseph Ravichandran, Weon Taek Na, Jay Lang, Mengjia Yan:
PACMAN: attacking ARM pointer authentication with speculative execution. 685-698 - Moinuddin K. Qureshi, Aditya Rohan, Gururaj Saileshwar, Prashant J. Nair:
Hydra: enabling low-overhead mitigation of row-hammer at ultra-low thresholds via hybrid tracking. 699-710 - Sangpyo Kim, Jongmin Kim, Michael Jaemin Kim, Wonkyung Jung, John Kim, Minsoo Rhu, Jung Ho Ahn:
BTS: an accelerator for bootstrappable fully homomorphic encryption. 711-725 - Weizhe Hua, Muhammad Umar, Zhiru Zhang, G. Edward Suh:
MGX: near-zero overhead memory protection for data-intensive accelerators. 726-741 - Shixin Song, Tanvir Ahmed Khan, Sara Mahdizadeh-Shahri, Akshitha Sriraman, Niranjan K. Soundararajan, Sreenivas Subramoney, Daniel A. Jiménez, Heiner Litz, Baris Kasikci:
Thermometer: profile-guided btb replacement for data center applications. 742-756 - David Schall, Artemiy Margaritov, Dmitrii Ustiugov, Andreas Sandberg, Boris Grot:
Lukewarm serverless functions: characterization and optimization. 757-770 - Hans Kasan, Gwangsun Kim, Yung Yi, John Kim:
Dynamic global adaptive routing in high-radix networks. 771-783 - Udit Gupta, Mariam Elgamal, Gage Hills, Gu-Yeon Wei, Hsien-Hsin S. Lee, David Brooks, Carole-Jean Wu:
ACT: designing sustainable computer systems with an architectural carbon modeling tool. 784-799 - Liam Patterson, David Pigorovsky, Brian Dempsey, Nikita Lazarev, Aditya Shah, Clara Steinhoff, Ariana Bruno, Justin Hu, Christina Delimitrou:
HiveMind: a hardware-software system stack for serverless edge swarms. 800-816 - Marcelo Orenes-Vera, Aninda Manocha, Jonathan Balkind, Fei Gao, Juan L. Aragón, David Wentzlaff, Margaret Martonosi:
Tiny but mighty: designing and realizing scalable latency tolerance for manycore SoCs. 817-830 - Nathaniel Bleier, Calvin Lee, Francisco Rodriguez, Antony Sou, Scott White, Rakesh Kumar:
FlexiCores: low footprint, high yield, field reprogrammable flexible microprocessors. 831-846 - Ceyu Xu, Chris Kjellqvist, Lisa Wu Wills:
SNS's not a synthesizer: a deep-learning-based synthesis predictor. 847-859 - Youngeun Kwon, Minsoo Rhu:
Training personalized recommendation systems from (GPU) scratch: look forward not backwards. 860-873 - Size Zheng, Renze Chen, Anjiang Wei, Yicheng Jin, Qin Han, Liqiang Lu, Bingyang Wu, Xiuhong Li, Shengen Yan, Yun Liang:
AMOS: enabling automatic mapping for tensor computations on spatial accelerators with hardware abstraction. 874-887 - Ali Hadi Zadeh, Mostafa Mahmoud, Ameer Abdelhadi, Andreas Moshovos:
Mokey: enabling narrow fixed-point inference for out-of-the-box floating-point transformer models. 888-901 - Zheng Li, Soroush Ghodrati, Amir Yazdanbakhsh, Hadi Esmaeilzadeh, Mingu Kang:
Accelerating attention through gradient-based learned runtime pruning. 902-915 - Zhangxiaowen Gong, Houxiang Ji, Yao Yao, Christopher W. Fletcher, Christopher J. Hughes, Josep Torrellas:
Graphite: optimizing graph neural networks on CPUs through cooperative software-hardware techniques. 916-931 - Yunjae Lee, Jinha Chung, Minsoo Rhu:
SmartSAGE: training large-scale graph neural networks using in-storage processing architectures. 932-945 - Shuangchen Li, Dimin Niu, Yuhao Wang, Wei Han, Zhe Zhang, Tianchan Guan, Yijin Guan, Heng Liu, Linyong Huang, Zhaoyang Du, Fei Xue, Yuanwei Fang, Hongzhong Zheng, Yuan Xie:
Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network. 946-961 - Yu Feng, Gunnar Hammonds, Yiming Gan, Yuhao Zhu:
Crescent: taming memory irregularities for accelerating deep point cloud analytics. 962-977 - Karthikeyan Sankaralingam, Tony Nowatzki, Vinay Gangadhar, Preyas Shah, Michael Davies, William Galliher, Ziliang Guo, Jitu Khare, Deepak Vijay, Poly Palamuttam, Maghawan Punde, Alex Tan, Vijay Thiruvengadam, Rongyi Wang, Shunmiao Xu:
The Mozart reuse exposed dataflow processor for AI and beyond: industrial product. 978-992 - Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng, Yinbin Ma, Junjie Yang, Ellie Wen, Hong Li, Lin Yang, Chonglin Sun, Whitney Zhao, Dimitry Melts, Krishna Dhulipala, K. R. Kishore, Tyler Graf, Assaf Eisenman, Kiran Kumar Matam, Adi Gangidi, Guoqiang Jerry Chen, Manoj Krishnan, Avinash Nayak, Krishnakumar Nair, Bharath Muthiah, Mahmoud khorashadi, Pallab Bhattacharya, Petr Lapukhov, Maxim Naumov, Ajit Mathews, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, Vijay Rao:
Software-hardware co-design for fast and scalable training of deep learning recommendation models. 993-1011 - Cédric Lichtenau, Alper Buyuktosunoglu, Ramon Bertran, Peter Figuli, Christian Jacobi, Nikolaos Papandreou, Haris Pozidis, Anthony Saporito, Andrew Sica, Elpida Tzortzatos:
AI accelerator on IBM telum processor: industrial product. 1012-1028 - Jian Chen, Xiaoyu Zhang, Tao Wang, Ying Zhang, Tao Chen, Jiajun Chen, Mingxu Xie, Qiang Liu:
Fidas: fortifying the cloud via comprehensive FPGA-based offloading for intrusion detection: industrial product. 1029-1041 - Mark Zhao, Niket Agarwal, Aarti Basant, Bugra Gedik, Satadru Pan, Mustafa Ozdal, Rakesh Komuravelli, Jerry Pan, Tianshu Bao, Haowei Lu, Sundaram Narayanan, Jack Langman, Kevin Wilfong, Harsha Rastogi, Carole-Jean Wu, Christos Kozyrakis, Parik Pol:
Understanding data storage and ingestion for large-scale deep recommendation model training: industrial product. 1042-1057 - Daniel Lustig, Simon Cooksey, Olivier Giroux:
Mixed-proxy extensions for the NVIDIA PTX memory consistency model: industrial product. 1058-1070
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.