default search action
ACM Transactions on Architecture and Code Optimization, Volume 8
Volume 8, Number 1, April 2011
- Stijn Eyerman, Lieven Eeckhout:
Fine-grained DVFS using on-chip regulators. 1:1-1:24 - Chen-Yong Cher, Eren Kursun:
Exploring the effects of on-chip thermal variation on high-performance multicore architectures. 2:1-2:22 - Carole-Jean Wu, Margaret Martonosi:
Adaptive timekeeping replacement: Fine-grained capacity management for shared CMP caches. 3:1-3:26 - Lucas Vespa, Ning Weng:
Deterministic finite automata characterization and optimization for scalable pattern matching. 4:1-4:31 - Abhishek Bhattacharjee, Gilberto Contreras, Margaret Martonosi:
Parallelization libraries: Characterizing and reducing overheads. 5:1-5:29
Volume 8, Number 2, July 2011
- Xiangyu Dong, Yuan Xie, Naveen Muralimanohar, Norman P. Jouppi:
Hybrid checkpointing using emerging nonvolatile memories for future exascale systems. 6:1-6:29 - Jianjun Li, Chenggang Wu, Wei-Chung Hsu:
Efficient and effective misaligned data access handling in a dynamic binary translation system. 7:1-7:29 - Guru Venkataramani, Christopher J. Hughes, Sanjeev Kumar, Milos Prvulovic:
DeFT: Design space exploration for on-the-fly detection of coherence misses. 8:1-8:27 - Jason Hiser, Daniel W. Williams, Wei Hu, Jack W. Davidson, Jason Mars, Bruce R. Childers:
Evaluating indirect branch handling mechanisms in software dynamic translation systems. 9:1-9:28
Volume 8, Number 3, October 2011
- Xi E. Chen, Tor M. Aamodt:
Hybrid analytical modeling of pending cache hits, data prefetching, and MSHRs. 10:1-10:28 - Marios Kleanthous, Yiannakis Sazeides:
CATCH: A mechanism for dynamically detecting cache-content-duplication in instruction caches. 11:1-11:27 - Hans Vandierendonck, André Seznec:
Managing SMT resource usage through speculative instruction window weighting. 12:1-12:20 - Po-Han Wang, Chia-Lin Yang, Yen-Ming Chen, Yu-Jung Cheng:
Power gating strategies on GPUs. 13:1-13:25 - Min Feng, Chen Tian, Changhui Lin, Rajiv Gupta:
Dynamic access distance driven cache replacement. 14:1-14:30 - Ahmad Samih, Yan Solihin, Anil Krishna:
Evaluating placement policies for managing capacity sharing in CMP architectures with private caches. 15:1-15:23 - Chang-Ching Yeh, Kuei-Chung Chang, Tien-Fu Chen, Chingwei Yeh:
Maintaining performance on power gating of microprocessor functional units by using a predictive pre-wakeup strategy. 16:1-16:27 - Hyunjin Lee, Sangyeun Cho, Bruce R. Childers:
DEFCAM: A design and evaluation framework for defect-tolerant cache memories. 17:1-17:29
Volume 8, Number 4, January 2012
- Per Stenström, Koen De Bosschere:
Introduction to the special issue on high-performance and embedded architectures and compilers. 18:1-18:2 - Jorge Albericio, Ruben Gran Tejero, Pablo Ibáñez, Víctor Viñals, José María Llabería:
ABS: A low-cost adaptive controller for prefetching in a banked shared last-level cache. 19:1-19:20 - Ali Galip Bayrak, Nikola Velickovic, Paolo Ienne, Wayne P. Burleson:
An architecture-independent instruction shuffler to protect against side-channel attacks. 20:1-20:19 - John Demme, Simha Sethumadhavan:
Approximate graph clustering for program characterization. 21:1-21:21 - Mihai Pricopi, Tulika Mitra:
Bahurupi: A polymorphic heterogeneous multi-core architecture. 22:1-22:21 - Jeroen Van Cleemput, Bart Coppens, Bjorn De Sutter:
Compiler mitigations for time attacks on modern x86 processors. 23:1-23:20 - Jason McCandless, David Gregg:
Compiler techniques to improve dynamic branch prediction for indirect jump and call instructions. 24:1-24:20 - Antonio García-Guirado, Ricardo Fernández Pascual, Alberto Ros, José M. García:
DAPSCO: Distance-aware partially shared cache organization. 25:1-25:19 - Zhenjiang Wang, Chenggang Wu, Pen-Chung Yew, Jianjun Li, Di Xu:
On-the-fly structure splitting for heap objects. 26:1-26:20 - Dibyendu Das, Benoît Dupont de Dinechin, Ramakrishna Upadrasta:
Efficient liveness computation using merge sets and DJ-graphs. 27:1-27:18 - George Patsilaras, Niket K. Choudhary, James Tuck:
Efficiently exploiting memory level parallelism on asymmetric coupled cores in the dark silicon era. 28:1-28:21 - Roman Malits, Evgeny Bolotin, Avinoam Kolodny, Avi Mendelson:
Exploring the limits of GPGPU scheduling in control flow bound applications. 29:1-29:22 - Lois Orosa, Elisardo Antelo, Javier D. Bruguera:
FlexSig: Implementing flexible hardware signatures. 30:1-30:20 - J. Rubén Titos Gil, Manuel E. Acacio, José M. García, Tim Harris, Adrián Cristal, Osman S. Unsal, Ibrahim Hur, Mateo Valero:
Hardware transactional memory with software-defined conflicts. 31:1-31:20 - Yongjoo Kim, Jongeun Lee, Toan X. Mai, Yunheung Paek:
Improving performance of nested loops on reconfigurable array processors. 32:1-32:23 - Madhura Purnaprajna, Paolo Ienne:
Making wide-issue VLIW processors viable on FPGAs. 33:1-33:16 - Petar Radojkovic, Sylvain Girbal, Arnaud Grasset, Eduardo Quiñones, Sami Yehia, Francisco J. Cazorla:
On the evaluation of the impact of shared resources in multithreaded COTS processors in time-critical environments. 34:1-34:25 - Leonid Domnitser, Aamer Jaleel, Jason Loew, Nael B. Abu-Ghazaleh, Dmitry Ponomarev:
Non-monopolizable caches: Low-complexity mitigation of cache side channel attacks. 35:1-35:21 - Alejandro Rico, Felipe Cabarcas, Carlos Villavieja, Milan Pavlovic, Augusto Vega, Yoav Etsion, Alex Ramírez, Mateo Valero:
On the simulation of large-scale architectures using multiple application abstraction levels. 36:1-36:20 - Selma Saidi, Pranav Tendulkar, Thierry Lepley, Oded Maler:
Optimizing explicit data transfers for data parallel applications on the cell architecture. 37:1-37:20 - Min Feng, Changhui Lin, Rajiv Gupta:
PLDS: Partitioning linked data structures for parallelism. 38:1-38:21 - Benoît Pradelle, Alain Ketterlin, Philippe Clauss:
Polyhedral parallelization of binary code. 39:1-39:21 - Yaozu Dong, Yu Chen, Zhenhao Pan, Jinquan Dai, Yunhong Jiang:
ReNIC: Architectural extension to SR-IOV I/O virtualization for efficient replication. 40:1-40:22 - Tom M. Bruintjes, Karel H. G. Walters, Sabih H. Gerez, Bert Molenkamp, Gerard J. M. Smit:
Sabrewing: A lightweight architecture for combined floating-point and integer arithmetic. 41:1-41:22 - Mario Kicherer, Fabian Nowak, Rainer Buchty, Wolfgang Karl:
Seamlessly portable applications: Managing the diversity of modern heterogeneous systems. 42:1-42:20 - Nathanaël Prémillieu, André Seznec:
SYRANT: SYmmetric resource allocation on not-taken and taken paths. 43:1-43:20 - William Hasenplaugh, Pritpal S. Ahuja, Aamer Jaleel, Simon C. Steely Jr., Joel S. Emer:
The gradient-based cache partitioning algorithm. 44:1-44:21 - Javier Lira, Timothy M. Jones, Carlos Molina, Antonio González:
The migration prefetcher: Anticipating data promotion in dynamic NUCA caches. 45:1-45:20 - Kishore Kumar Pusukuri, Rajiv Gupta, Laxmi N. Bhuyan:
Thread Tranquilizer: Dynamically reducing performance variation. 46:1-46:21 - Dongsong Zhang, Deke Guo, Fang-Yuan Chen, Fei Wu, Tong Wu, Ting Cao, Shiyao Jin:
TL-plane-based multi-core energy-efficient real-time scheduling algorithm for sporadic tasks. 47:1-47:20 - Michael J. Lyons, Mark Hempstead, Gu-Yeon Wei, David M. Brooks:
The accelerator store: A shared memory framework for accelerator-based systems. 48:1-48:22 - Daniel A. Orozco, Elkin Garcia, Rishi Khan, Kelly Livingston, Guang R. Gao:
Toward high-throughput algorithms on many-core architectures. 49:1-49:21 - Kevin Stock, Louis-Noël Pouchet, P. Sadayappan:
Using machine learning to improve automatic vectorization. 50:1-50:23 - Kanit Therdsteerasukdi, Gyungsu Byun, Jason Cong, M. Frank Chang, Glenn Reinman:
Utilizing RF-I and intelligent scheduling for better throughput/watt in a mobile GPU memory system. 51:1-51:19 - Frederick Ryckbosch, Stijn Polfliet, Lieven Eeckhout:
VSim: Simulating multi-server setups at near native hardware speed. 52:1-52:20 - Miao Zhou, Yu Du, Bruce R. Childers, Rami G. Melhem, Daniel Mossé:
Writeback-aware partitioning and replacement for last-level caches in phase change main memory systems. 53:1-53:21 - Qingping Wang, Sameer Kulkarni, John Cavazos, Michael F. Spear:
A transactional memory with automatic performance tuning. 54:1-54:23 - Bartosz Bogdanski, Sven-Arne Reinemo, Frank Olaf Sem-Jacobsen, Ernst Gunnar Gran:
sFtree: A fully connected and deadlock-free switch-to-switch routing algorithm for fat-trees. 55:1-55:20
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.