default search action
43rd ISCA 2016: Seoul, South Korea
- 43rd ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2016, Seoul, South Korea, June 18-22, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-8947-1
Session 1A: Neural Networks I
- Jorge Albericio, Patrick Judd, Tayler H. Hetherington, Tor M. Aamodt, Natalie D. Enright Jerger, Andreas Moshovos:
Cnvlutin: Ineffectual-Neuron-Free Deep Neural Network Computing. 1-13 - Ali Shafiee, Anirban Nag, Naveen Muralimanohar, Rajeev Balasubramonian, John Paul Strachan, Miao Hu, R. Stanley Williams, Vivek Srikumar:
ISAAC: A Convolutional Neural Network Accelerator with In-Situ Analog Arithmetic in Crossbars. 14-26 - Ping Chi, Shuangchen Li, Cong Xu, Tao Zhang, Jishen Zhao, Yongpan Liu, Yu Wang, Yuan Xie:
PRIME: A Novel Processing-in-Memory Architecture for Neural Network Computation in ReRAM-Based Main Memory. 27-39
Session 1B: Heterogeneous Architecture/ Approximate Computing
- Christopher Torng, Moyang Wang, Christopher Batten:
Asymmetry-Aware Work-Stealing Runtimes. 40-52 - Hung-Wei Tseng, Qianchen Zhao, Yuxiao Zhou, Mark Gahagan, Steven Swanson:
Morpheus: Creating Application Objects Efficiently for Heterogeneous Computing. 53-65 - Divya Mahajan, Amir Yazdanbakhsh, Jongse Park, Bradley Thwaites, Hadi Esmaeilzadeh:
Towards Statistical Guarantees in Controlling Quality Tradeoffs for Approximate Acceleration. 66-77
Session 2A: Caches
- Akanksha Jain, Calvin Lin:
Back to the Future: Leveraging Belady's Algorithm for Improved Cache Replacement. 78-89 - Chang Hyun Park, Taekyung Heo, Jaehyuk Huh:
Efficient Synonym Filtering and Scalable Delayed Translation for Hybrid Virtual Caching. 90-102 - Hsiang-Yun Cheng, Jishen Zhao, Jack Sampson, Mary Jane Irwin, Aamer Jaleel, Yu Lu, Yuan Xie:
LAP: Loop-Block Aware Inclusion Properties for Energy-Efficient Asymmetric Last Level Caches. 103-114
Session 2B: Hardware Design
- David Koeplinger, Raghu Prabhakar, Yaqi Zhang, Christina Delimitrou, Christos Kozyrakis, Kunle Olukotun:
Automatic Generation of Efficient Accelerators for Reconfigurable Hardware. 115-127 - Donggyu Kim, Adam M. Izraelevitz, Christopher Celio, Hokeun Kim, Brian Zimmer, Yunsup Lee, Jonathan Bachrach, Krste Asanovic:
Strober: Fast and Accurate Sample-Based Energy Simulation for Arbitrary RTL. 128-139 - Michael A. Laurenzano, Yunqi Zhang, Jiang Chen, Lingjia Tang, Jason Mars:
PowerChop: Identifying and Managing Non-critical Units in Hybrid Processor Architectures. 140-152
Session 3A: Accelerators
- Boncheol Gu, Andre S. Yoon, Duck-Ho Bae, Insoon Jo, Jinyoung Lee, Jonghyun Yoon, Jeong-Uk Kang, Moonsang Kwon, Chanho Yoon, Sangyeun Cho, Jaeheon Jeong, Duckhyun Chang:
Biscuit: A Framework for Near-Data Processing of Big Data Workloads. 153-165 - Muhammet Mustafa Ozdal, Serif Yesil, Taemin Kim, Andrey Ayupov, John Greth, Steven M. Burns, Özcan Özturk:
Energy Efficient Architecture for Graph Analytics Accelerators. 166-177 - Ikuo Magaki, Moein Khazraee, Luis Vega Gutierrez, Michael Bedford Taylor:
ASIC Clouds: Specializing the Datacenter. 178-190
Session 3B: GPU I
- Yunho Oh, Keunsoo Kim, Myung Kuk Yoon, Jong Hyun Park, Yongjun Park, Won Woo Ro, Murali Annavaram:
APRES: Improving Cache Efficiency by Exploiting Load Characteristics on GPUs. 191-203 - Kevin Hsieh, Eiman Ebrahimi, Gwangsun Kim, Niladrish Chatterjee, Mike O'Connor, Nandita Vijaykumar, Onur Mutlu, Stephen W. Keckler:
Transparent Offloading and Mapping (TOM): Enabling Programmer-Transparent Near-Data Processing in GPU Systems. 204-216 - Qiumin Xu, Hyeran Jeon, Keunsoo Kim, Won Woo Ro, Murali Annavaram:
Warped-Slicer: Efficient Intra-SM Slicing through Dynamic Resource Partitioning for GPU Multiprogramming. 230-242
Session 4A: Neural Networks II
- Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, William J. Dally:
EIE: Efficient Inference Engine on Compressed Deep Neural Network. 243-254 - Robert LiKamWa, Yunhui Hou, Yuan Gao, Mia Polansky, Lin Zhong:
RedEye: Analog ConvNet Image Sensor Architecture for Continuous Mobile Vision. 255-266 - Brandon Reagen, Paul N. Whatmough, Robert Adolf, Saketh Rama, Hyunkwang Lee, Sae Kyu Lee, José Miguel Hernández-Lobato, Gu-Yeon Wei, David M. Brooks:
Minerva: Enabling Low-Power, Highly-Accurate Deep Neural Network Accelerators. 267-278
Session 4B: NoC/Virtualization
- Yuan Yao, Zhonghai Lu:
Opportunistic Competition Overhead Reduction for Expediting Critical Section in NoC Based CMPs. 279-290 - Channoh Kim, Sungmin Kim, Hyeon-Gyu Cho, Doo-Young Kim, Jaehyeok Kim, Young H. Oh, Hakbeom Jang, Jae W. Lee:
Short-Circuit Dispatch: Accelerating Virtual Machine Interpreters on Embedded Processors. 291-303 - Christoffer Dall, Shih-Wei Li, Jin Tack Lim, Jason Nieh, Georgios Koloventzos:
ARM Virtualization: Performance and Architectural Implications. 304-316
Session 5A: Cache/Memory Compression
- Jayesh Gaur, Alaa R. Alameldeen, Sreenivas Subramoney:
Base-Victim Compression: An Opportunistic Cache Compression Architecture. 317-328 - Jungrae Kim, Michael B. Sullivan, Esha Choukse, Mattan Erez:
Bit-Plane Compression: Transforming Data for Better Compression in Many-Core Architectures. 329-340
Session 5B: Reliability I
- Prashant J. Nair, Vilas Sridharan, Moinuddin K. Qureshi:
XED: Exposing On-Die Error Detection Information for Strong Memory Reliability. 341-353 - Mohammad Mejbah Ul Alam, Abdullah Muzahid:
Production-Run Software Failure Diagnosis via Adaptive Communication Tracking. 354-366
Session 6: Neural Networks III
- Yu-Hsin Chen, Joel S. Emer, Vivienne Sze:
Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks. 367-379 - Duckhwan Kim, Jaeha Kung, Sek M. Chai, Sudhakar Yalamanchili, Saibal Mukhopadhyay:
Neurocube: A Programmable Digital Neuromorphic Architecture with High-Density 3D Memory. 380-392 - Shaoli Liu, Zidong Du, Jinhua Tao, Dong Han, Tao Luo, Yuan Xie, Yunji Chen, Tianshi Chen:
Cambricon: An Instruction Set Architecture for Neural Networks. 393-405
Session 7A: Micro Architecture
- Ziqiang Huang, Andrew D. Hilton, Benjamin C. Lee:
Decoupling Loads for Nano-Instruction Set Computers. 406-417 - Timothy Hayes, Oscar Palomar, Osman S. Unsal, Adrián Cristal, Mateo Valero:
Future Vector Microprocessor Extensions for Data Aggregations. 418-430 - Faissal M. Sleiman, Thomas F. Wenisch:
Efficiently Scaling Out-of-Order Cores for Simultaneous Multithreading. 431-443 - Milad Hashemi, Khubaib, Eiman Ebrahimi, Onur Mutlu, Yale N. Patt:
Accelerating Dependent Cache Misses with an Enhanced Memory Controller. 444-455
Session 7B: Datacenter
- Yunqi Zhang, David Meisner, Jason Mars, Lingjia Tang:
Treadmill: Attributing the Source of Tail Latency through Precise Load Testing and Statistical Inference. 456-468 - Qiang Wu, Qingyuan Deng, Lakshmi Ganesh, Chang-Hong Hsu, Yun Jin, Sanjeev Kumar, Bin Li, Justin Meza, Yee Jiun Song:
Dynamo: Facebook's Data Center-Wide Power Management System. 469-480 - Daniel Wong:
Peak Efficiency Aware Scheduling for Highly Energy Proportional Servers. 481-492 - Chao Li, Zhenhua Wang, Xiaofeng Hou, Haopeng Chen, Xiaoyao Liang, Minyi Guo:
Power Attack Defense: Securing Battery-Backed Data Centers. 493-505
Session 8A: Memory I
- Mingyu Gao, Christina Delimitrou, Dimin Niu, Krishna T. Malladi, Hongzhong Zheng, Bob Brennan, Christos Kozyrakis:
DRAF: A Low-Power DRAM-Based Reconfigurable Acceleration Fabric. 506-518 - Lunkai Zhang, Brian Neely, Diana Franklin, Dmitri B. Strukov, Yuan Xie, Frederic T. Chong:
Mellow Writes: Extending Lifetime in Resistive Memories through Selective Slow Write Backs. 519-531 - Yanqi Zhou, David Wentzlaff:
MITTS: Memory Inter-arrival Time Traffic Shaping. 532-544
Session 8B: Emerging Architectures
- Joshua San Miguel, Natalie D. Enright Jerger:
The Anytime Automaton. 545-557 - Siyang Wang, Xiangyu Zhang, Yuxuan Li, Ramin Bashizade, Song Yang, Chris Dwyer, Alvin R. Lebeck:
Accelerating Markov Random Field Inference Using Molecular Optical Gibbs Sampling Units. 558-569 - Yipeng Huang, Ning Guo, Mingoo Seok, Yannis P. Tsividis, Simha Sethumadhavan:
Evaluation of an Analog Accelerator for Linear Algebra. 570-582
Session 9A: GPU II
- Jin Wang, Norm Rubin, Albert Sidelnik, Sudhakar Yalamanchili:
LaPerm: Locality Aware Scheduler for Dynamic Parallelism on GPUs. 583-595 - Sagi Shahar, Shai Bergman, Mark Silberstein:
ActivePointers: A Case for Software Address Translation on GPUs. 596-608 - Myung Kuk Yoon, Keunsoo Kim, Sangpil Lee, Won Woo Ro, Murali Annavaram:
Virtual Thread: Maximizing Thread-Level Parallelism beyond GPU Scheduling Limit. 609-621
Session 9B: Reliability II
- Jungrae Kim, Michael B. Sullivan, Sangkug Lym, Mattan Erez:
All-Inclusive ECC: Thorough End-to-End Protection for Reliable Computer Memory. 622-633 - Henry Duwe, Xun Jian, Daniel Petrisko, Rakesh Kumar:
Rescuing Uncorrectable Fault Patterns in On-Chip Memories through Error Pattern Transformation. 634-644 - Dong-Wan Kim, Mattan Erez:
RelaxFault Memory Repair. 645-657
Session 10A: Energy Efficient Computing
- Raghavendra Pradyumna Pothukuchi, Amin Ansari, Petros G. Voulgaris, Josep Torrellas:
Using Multiple Input, Multiple Output Formal Control to Maximize Resource Efficiency in Architectures. 658-670 - Hari Cherupalli, Rakesh Kumar, John Sartori:
Exploiting Dynamic Timing Slack for Energy Efficiency in Ultra-Low-Power Embedded Systems. 671-681 - Yanqi Zhou, Henry Hoffmann, David Wentzlaff:
CASH: Supporting IaaS Customers with a Sub-core Configurable Architecture. 682-694
Session 10B: Memory II
- Mohammad Arjomand, Mahmut T. Kandemir, Anand Sivasubramaniam, Chita R. Das:
Boosting Access Parallelism to PCM-Based Main Memory. 695-706 - Jayneel Gandhi, Mark D. Hill, Michael M. Swift:
Agile Paging: Exceeding the Best of Nested and Shadow Paging. 707-718 - Hoseok Seol, Wongyu Shin, Jaemin Jang, Jungwhan Choi, Jinwoong Suh, Lee-Sup Kim:
Energy Efficient Data Encoding in DRAM Channels Exploiting Data Value Similarity. 719-730
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.