default search action
21st HPCA 2015: Burlingame, CA, USA
- 21st IEEE International Symposium on High Performance Computer Architecture, HPCA 2015, Burlingame, CA, USA, February 7-11, 2015. IEEE Computer Society 2015, ISBN 978-1-4799-8930-0
- Nandhini Chandramoorthy, Giuseppe Tagliavini, Kevin M. Irick, Antonio Pullini, Siddharth Advani, Sulaiman Al Habsi, Matthew Cotter, John Sampson, Vijaykrishnan Narayanan, Luca Benini:
Exploring architectural heterogeneity in intelligent vision systems. 1-12 - Arthur Perais, André Seznec:
BeBoP: A cost effective predictor infrastructure for superscalar value prediction. 13-25 - Timothy Hayes, Oscar Palomar, Osman S. Unsal, Adrián Cristal, Mateo Valero:
VSR sort: A novel vectorised sorting algorithm & architecture extensions for future microprocessors. 26-38 - Víctor Jiménez, Alper Buyuktosunoglu, Pradip Bose, Francis P. O'Connell, Francisco J. Cazorla, Mateo Valero:
Increasing multicore system efficiency through intelligent bandwidth shifting. 39-50 - Gennady Pekhimenko, Tyler Huberty, Rui Cai, Onur Mutlu, Phillip B. Gibbons, Michael A. Kozuch, Todd C. Mowry:
Exploiting compressed block size as an indicator of future reuse. 51-63 - Nathan Beckmann, Daniel Sánchez:
Talus: A simple way to remove cliffs in cache performance. 64-75 - Xiaolong Xie, Yun Liang, Yu Wang, Guangyu Sun, Tao Wang:
Coordinated static and dynamic cache bypassing for GPUs. 76-88 - Dong Li, Minsoo Rhu, Daniel R. Johnson, Mike O'Connor, Mattan Erez, Doug Burger, Donald S. Fussell, Stephen W. Redder:
Priority-based cache allocation in throughput processors. 89-100 - Jungrae Kim, Michael B. Sullivan, Mattan Erez:
Bamboo ECC: Strong, safe, and flexible codes for reliable computer memory. 101-112 - Xiaodong Wang, José F. Martínez:
XChange: A market-based approach to scalable dynamic multi-resource allocation in multicore architectures. 113-125 - Mitesh R. Meswani, Sergey Blagodurov, David Roberts, John Slice, Mike Ignatowski, Gabriel H. Loh:
Heterogeneous memory architectures: A HW/SW approach for mixing die-stacked and off-package memories. 126-136 - Yuhao Zhu, Matthew Halpern, Vijay Janapa Reddi:
Event-based scheduling for energy-efficient QoS (eQoS) in mobile Web applications. 137-149 - Nachiappan Chidambaram Nachiappan, Praveen Yedlapalli, Niranjan Soundararajan, Anand Sivasubramaniam, Mahmut T. Kandemir, Ravishankar R. Iyer, Chita R. Das:
Domain knowledge based energy management in handhelds. 150-160 - Jingwen Leng, Yazhou Zu, Vijay Janapa Reddi:
GPU voltage noise: Characterization and hierarchical smoothing of spatial and temporal voltage noise interference in GPU architectures. 161-173 - Ankit Sethia, Davoud Anoushe Jamshidi, Scott A. Mahlke:
Mascar: Speeding up GPU warps by reducing memory pitstops. 174-185 - Alberto Ros, Mahdad Davari, Stefanos Kaxiras:
Hierarchical private/shared classification: The key to simple and efficient coherence for clustered cache hierarchies. 186-197 - Lucia G. Menezo, Valentin Puente, José-Ángel Gregorio:
Flask coherence: A morphable hybrid coherence protocol to balance energy, performance and scalability. 198-209 - Misel-Myrto Papadopoulou, Xin Tong, André Seznec, Andreas Moshovos:
Prediction-based superpage-friendly TLB designs. 210-222 - Yu Du, Miao Zhou, Bruce R. Childers, Daniel Mossé, Rami G. Melhem:
Supporting superpages in non-contiguous physical memory. 223-234 - Mohammad A. Islam, A. Hasan Mahmud, Shaolei Ren, Xiaorui Wang:
Paying to save: Reducing cost of colocation data center via rewards. 235-245 - Vinicius Petrucci, Michael A. Laurenzano, John Doherty, Yunqi Zhang, Daniel Mossé, Jason Mars, Lingjia Tang:
Octopus-Man: QoS-driven task management for heterogeneous multicores in warehouse-scale computers. 246-258 - Ming Liu, Tao Li, Neo Jia, Andy Currid, Vladimir Troy:
Understanding the virtualization "Tax" of scale-out pass-through GPUs in GaaS clouds: An empirical study. 259-270 - Chang-Hong Hsu, Yunqi Zhang, Michael A. Laurenzano, David Meisner, Thomas F. Wenisch, Jason Mars, Lingjia Tang, Ronald G. Dreslinski:
Adrenaline: Pinpointing and reining in tail queries with quick voltage boosting. 271-282 - Amin Farmahini Farahani, Jung Ho Ahn, Katherine Morrow, Nam Sung Kim:
NDA: Near-DRAM acceleration architecture leveraging commodity DRAM devices and standard memory modules. 283-295 - Hao Wang, Chang-Jae Park, Gyungsu Byun, Jung Ho Ahn, Nam Sung Kim:
Alloy: Parallel-serial memory channel architecture for single-chip heterogeneous processor systems. 296-308 - Prashant J. Nair, Chia-Chen Chou, Bipin Rajendran, Moinuddin K. Qureshi:
Reducing read latency of phase change memory via early read and Turbo Read. 309-319 - Rakan Maddah, Seyed Mohammad Seyedzadeh, Rami G. Melhem:
CAFO: Cost aware flip optimization for asymmetric memories. 320-330 - Devesh Tiwari, Saurabh Gupta, James H. Rogers, Don Maxwell, Paolo Rech, Sudharshan S. Vazhkudai, Daniel Oliveira, Dave Londo, Nathan DeBardeleben, Philippe Olivier Alexandre Navaux, Luigi Carro, Arthur S. Bland:
Understanding GPU errors on large-scale HPC systems and the implications for system design and operation. 331-342 - Aamer Jaleel, Joseph Nuzman, Adrian Moga, Simon C. Steely Jr., Joel S. Emer:
High performing cache hierarchies for server workloads: Relaxing inclusion to capture the latency benefits of exclusive caches. 343-353 - Neha Agarwal, David W. Nellans, Mike O'Connor, Stephen W. Keckler, Thomas F. Wenisch:
Unlocking bandwidth for GPUs in CC-NUMA systems. 354-365 - Manish Arora, Srilatha Manne, Indrani Paul, Nuwan Jayasena, Dean M. Tullsen:
Understanding idle behavior and power gating mechanisms in the context of modern benchmarks on CPU-GPU Integrated systems. 366-377 - Lizhong Chen, Di Zhu, Massoud Pedram, Timothy Mark Pinkston:
Power punch: Towards non-blocking power-gating of NoC routers. 378-389 - Ikki Fujiwara, Michihiro Koibuchi, Tomoya Ozaki, Hiroki Matsutani, Henri Casanova:
Augmenting low-latency HPC network with free-space optical links. 390-401 - Nikolaos Chrysos, Cyriel Minkenberg, Mark Rudquist, Claude Basso, Brian Vanderpool:
SCOC: High-radix switches made of bufferless clos networks. 402-414 - Jongmin Won, Gwangsun Kim, John Kim, Ted Jiang, Mike Parker, Steve Scott:
Overcoming far-end congestion in large-scale networks. 415-427 - David J. Palframan, Nam Sung Kim, Mikko H. Lipasti:
iPatch: Intelligent fault patching to improve energy efficiency. 428-438 - Dong-Wan Kim, Mattan Erez:
Balancing reliability, cost, and performance tradeoffs with FreeFault. 439-450 - Xinxin Jin, Soyeon Park, Tianwei Sheng, Rishan Chen, Zhiyong Shan, Yuanyuan Zhou:
FTXen: Making hypervisor resilient to hardware faults on relaxed cores. 451-462 - Henry Duwe, Xun Jian, Rakesh Kumar:
Correction prediction: Reducing error correction latency for on-chip memories. 463-475 - Cong Xu, Dimin Niu, Naveen Muralimanohar, Rajeev Balasubramonian, Tao Zhang, Shimeng Yu, Yuan Xie:
Overcoming the challenges of crossbar resistive memory architectures. 476-488 - Donghyuk Lee, Yoongu Kim, Gennady Pekhimenko, Samira Manabi Khan, Vivek Seshadri, Kevin Kai-Wei Chang, Onur Mutlu:
Adaptive-latency DRAM: Optimizing DRAM timing for the common-case. 489-501 - Young Hoon Son, Sukhan Lee, Seongil O, Sanghyuk Kwon, Nam Sung Kim, Jung Ho Ahn:
CiDRA: A cache-inspired DRAM resilience architecture. 502-513 - Sean Franey, Mikko H. Lipasti:
Tag tables. 514-525 - Kaisheng Ma, Yang Zheng, Shuangchen Li, Karthik Swaminathan, Xueqing Li, Yongpan Liu, Jack Sampson, Yuan Xie, Vijaykrishnan Narayanan:
Architecture exploration for ambient energy harvesting nonvolatile processors. 526-537 - Nathan Beckmann, Po-An Tsai, Daniel Sánchez:
Scaling distributed cache hierarchies through computation and data co-scheduling. 538-550 - Yu Cai, Yixin Luo, Erich F. Haratsch, Ken Mai, Onur Mutlu:
Data retention in MLC NAND flash memory: Characterization, optimization, and recovery. 551-563 - Gene Y. Wu, Joseph L. Greathouse, Alexander Lyashevsky, Nuwan Jayasena, Derek Chiou:
GPGPU performance and power estimation using machine learning. 564-576 - Sam Likun Xi, Hans M. Jacobson, Pradip Bose, Gu-Yeon Wei, David M. Brooks:
Quantifying sources of error in McPAT and potential impacts on architectural studies. 577-589 - Minshu Zhao, Donald Yeung:
Studying the impact of multicore processor scaling on directory techniques via reuse distance analysis. 590-602 - Thierry Moreau, Mark Wyse, Jacob Nelson, Adrian Sampson, Hadi Esmaeilzadeh, Luis Ceze, Mark Oskin:
SNNAP: Approximate computing on programmable SoCs via neural acceleration. 603-614 - Beayna Grigorian, Nazanin Farahpour, Glenn Reinman:
BRAINIAC: Bringing reliable accuracy into neurally-implemented approximate computing. 615-626 - Sarah Neuwirth, Dirk Frey, Mondrian Nuessle, Ulrich Brüning:
Scalable communication architecture for network-attached accelerators. 627-638 - Casen Hunger, Mikhail Kazdagli, Ankit Singh Rawat, Alexandros G. Dimakis, Sriram Vishwanath, Mohit Tiwari:
Understanding contention-based channels and using them for defense. 639-650 - Meltem Ozsoy, Caleb Donovick, Iakov Gorelik, Nael B. Abu-Ghazaleh, Dmitry V. Ponomarev:
Malware-aware processors: A framework for efficient online malware detection. 651-661 - Daniel Lo, Tao Chen, Mohamed Ismail, G. Edward Suh:
Run-time monitoring with adjustable overhead using dataflow-guided filtering. 662-674
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.