default search action
PACT 2010: Vienna, Austria
- Valentina Salapura, Michael Gschwind, Jens Knoop:
19th International Conference on Parallel Architectures and Compilation Techniques, PACT 2010, Vienna, Austria, September 11-15, 2010. ACM 2010, ISBN 978-1-4503-0178-7
Keynote address I
- David A. Ferrucci:
Build Watson: an overview of DeepQA for the Jeopardy! challenge. 1-2
Keynote address II
- Keshav Pingali:
Towards a science of parallel programming. 3-4
Keynote address III
- Wen-mei W. Hwu:
Raising the level of many-core programming with compiler technology: meeting a grand challenge. 5-6
Power-aware design
- Víctor Jiménez, Francisco J. Cazorla, Roberto Gioiosa, Mateo Valero, Carlos Boneti, Eren Kursun, Chen-Yong Cher, Canturk Isci, Alper Buyuktosunoglu, Pradip Bose:
Power and thermal characterization of POWER6 system. 7-18 - Karthik Ganesan, Jungho Jo, William Lloyd Bircher, Dimitris Kaseridis, Zhibin Yu, Lizy K. John:
System-level max power (SYMPO): a systematic approach for escalating system-level power consumption using synthetic benchmarks. 19-28 - Jonathan A. Winter, David H. Albonesi, Christine A. Shoemaker:
Scalable thread scheduling and global power management for heterogeneous many-core architectures. 29-40 - Matthew A. Watkins, David H. Albonesi:
Dynamically managed multithreaded reconfigurable architectures for chip multiprocessors. 41-52
Analysis & optimization
- Derek L. Schuff, Milind Kulkarni, Vijay S. Pai:
Accelerating multicore reuse distance analysis with sampling and parallelization. 53-64 - Nalini Vasudevan, Kedar S. Namjoshi, Stephen A. Edwards:
Simple and fast biased locks. 65-74 - Hari K. Pyla, Srinidhi Varadarajan:
Avoiding deadlock avoidance. 75-86 - Yun Zhang, Jae W. Lee, Nick P. Johnson, David I. August:
DAFT: decoupled acyclic fault tolerance. 87-98
Caches & coherence I
- John H. Kelm, Matthew R. Johnson, Steven S. Lumetta, Sanjay J. Patel:
WAYPOINT: scaling coherence to thousand-core architectures. 99-110 - Daehoon Kim, Jeongseob Ahn, Jaehong Kim, Jaehyuk Huh:
Subspace snooping: filtering snoops with operating system support. 111-122 - Nick Barrow-Williams, Christian Fensch, Simon W. Moore:
Proximity coherence for chip multiprocessors. 123-134 - Hongzhou Zhao, Arrvindh Shriraman, Sandhya Dwarkadas:
SPACE: sharing pattern-based directory coherence for multicore scalability. 135-146
Parallelization and parallel programming I
- M. Aater Suleman, Moinuddin K. Qureshi, Khubaib, Yale N. Patt:
Feedback-directed pipeline parallelism. 147-156 - Zheng Li, Olivier Certner, José Duato, Olivier Temam:
Scalable hardware support for conditional parallelization. 157-168 - Jisheng Zhao, Jun Shirako, V. Krishna Nandivada, Vivek Sarkar:
Reducing task creation and termination overhead in explicitly parallel programs. 169-180 - Ganesh S. Dasika, Ankit Sethia, Vincentius Robby, Trevor N. Mudge, Scott A. Mahlke:
MEDICS: ultra-portable processing for medical image reconstruction. 181-192
Heterogeneous platforms and frameworks
- Jaejin Lee, Jungwon Kim, Sangmin Seo, Seungkyun Kim, Jung-Ho Park, Honggyu Kim, Thanh Tuan Dao, Yongjin Cho, Sung Jong Seo, Seung Hak Lee, Seung Mo Cho, Hyo Jung Song, Sang-Bum Suh, Jong-Deok Choi:
An OpenCL framework for heterogeneous multicores with local memory. 193-204 - Jayanth Gummaraju, Laurent Morichetti, Michael Houston, Ben Sander, Benedict R. Gaster, Bixia Zheng:
Twin peaks: a software platform for heterogeneous computing on general-purpose and graphics processors. 205-216 - Chuntao Hong, Dehao Chen, Wenguang Chen, Weimin Zheng, Haibo Lin:
MapCG: writing parallel program portable between CPU and GPU. 217-226 - Pradip Hari, John B. P. McCabe, Jonathan Banafato, Marcus Henry, Kevin Ko, Emmanouil Koukoumidis, Ulrich Kremer, Margaret Martonosi, Li-Shiuan Peh:
Adaptive spatiotemporal node selection in dynamic networks. 227-236
Scheduling and design optimization
- Di Xu, Chenggang Wu, Pen-Chung Yew:
On mitigating memory bandwidth contention through bandwidth-aware scheduling. 237-248 - Sergey Zhuravlev, Sergey Blagodurov, Alexandra Fedorova:
AKULA: a toolset for experimenting and developing thread placement algorithms on multicore systems. 249-260 - Srihari Cadambi, Abhinandan Majumdar, Michela Becchi, Srimat T. Chakradhar, Hans Peter Graf:
A programmable parallel accelerator for learning and classification. 273-284
Best papers
- Ferad Zyulkyarov, Srdjan Stipic, Tim Harris, Osman S. Unsal, Adrián Cristal, Ibrahim Hur, Mateo Valero:
Discovering and understanding performance bottlenecks in transactional applications. 285-294 - Changhui Lin, Vijay Nagarajan, Rajiv Gupta:
Efficient sequential consistency using conditional fences. 295-306 - Zheng Wang, Michael F. P. O'Boyle:
Partitioning streaming parallelism for multi-cores: a machine learning based approach. 307-318 - Manu Awasthi, David W. Nellans, Kshitij Sudan, Rajeev Balasubramonian, Al Davis:
Handling the problems and opportunities posed by multiple on-chip memory controllers. 319-330
Languages and compilers
- Amit Kumar, Lorenzo De Carli, Sung Jin Kim, Marc de Kruijf, Karthikeyan Sankaralingam, Cristian Estan, Somesh Jha:
Design and implementation of the PLUG architecture for programmable and efficient network lookups. 331-342 - Uday Bondhugula, Oktay Günlük, Sanjeeb Dash, Lakshminarayanan Renganarayanan:
A model for fusion and code motion in an automatic parallelizing compiler. 343-352 - Gregory Frederick Diamos, Andrew Kerr, Sudhakar Yalamanchili, Nathan Clark:
Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems. 353-364 - William Thies, Saman P. Amarasinghe:
An empirical characterization of stream programs and its implications for language and compiler design. 365-376
Parallelization and parallel programming II
- Hans Vandierendonck, Sean Rul, Koen De Bosschere:
The Paralax infrastructure: automatic parallelization with a helping hand. 389-400 - Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine:
AM++: a generalized active message framework. 401-410 - I-Ting Angelina Lee, Silas Boyd-Wickizer, Zhiyi Huang, Charles E. Leiserson:
Using memory mapping to support cactus stacks in work-stealing runtime systems. 411-420
Speculation
- Rania H. Mameesh, Manoj Franklin:
Speculative-aware execution: a simple and efficient technique for utilizing multi-cores to improve single-thread performance. 421-430 - Walid J. Ghandour, Haitham Akkary, Wes Masri:
The potential of using dynamic information flow analysis in data value prediction. 431-442 - Tanausú Ramírez, Alex Pajuelo, Oliverio J. Santana, Onur Mutlu, Mateo Valero:
Efficient runahead threads. 443-452 - Yangchun Luo, Venkatesan Packirisamy, Wei-Chung Hsu, Antonia Zhai:
Energy efficient speculative threads: dynamic thread allocation in Same-ISA heterogeneous multicore systems. 453-464
Caches and coherence II
- Seth H. Pugsley, Josef B. Spjut, David W. Nellans, Rajeev Balasubramonian:
SWEL: hardware cache coherence protocols to map shared data onto shared caches. 465-476 - George Kurian, Jason E. Miller, James Psota, Jonathan Eastep, Jifeng Liu, Jürgen Michel, Lionel C. Kimerling, Anant Agarwal:
ATAC: a 1000-core cache-coherent processor with on-chip optical network. 477-488 - Samira Manabi Khan, Daniel A. Jiménez, Doug Burger, Babak Falsafi:
Using dead blocks as a virtual victim cache. 489-500
Data distribution and tiling
- Yong Li, Ahmed Abousamra, Rami G. Melhem, Alex K. Jones:
Compiler-assisted data distribution for chip multiprocessors. 501-512 - I-Jui Sung, John A. Stratton, Wen-mei W. Hwu:
Data layout transformation exploiting memory-level parallelism in structured grid many-core applications. 513-522 - Rong Chen, Haibo Chen, Binyu Zang:
Tiled-MapReduce: optimizing resource usages of data-parallel applications on multicore with tiling. 523-534
Poster session
- Ali Bakhoda, John Kim, Tor M. Aamodt:
On-chip network design considerations for compute accelerators. 535-536 - Rajesh Bordawekar, Uday Bondhugula, Ravi Rao:
Believe it or not!: mult-core CPUs can match GPU performance for a FLOP-intensive application! 537-538 - Muhammad Amber Hassaan, Martin Burtscher, Keshav Pingali:
Ordered and unordered algorithms for parallel breadth first search. 539-540 - Matthew Misler, Natalie D. Enright Jerger:
Moths: mobile threads for on-chip networks. 541-542 - Santhosh Sharma Ananthramu, Deepak Majeti, Sanjeev Kumar Aggarwal, Mainak Chaudhuri:
Improving speculative loop parallelization via selective squash and speculation reuse. 543-544 - Duane Merrill, Andrew S. Grimshaw:
Revisiting sorting for GPGPU stream architectures. 545-546 - Sandya S. Mannarswamy, Ramaswamy Govindarajan:
Analyzing cache performance bottlenecks of STM applications and addressing them with compiler's help. 547-548 - Mohammad Hammoud, Sangyeun Cho, Rami G. Melhem:
An intra-tile cache set balancing scheme. 549-550 - David Eklov, David Black-Schaffer, Erik Hagersten:
StatCC: a statistical cache contention model. 551-552 - Wenjing Ma, Gagan Agrawal:
An integer programming framework for optimizing shared memory use on GPUs. 553-554 - Rafael Ubal, Julio Sahuquillo, Salvador Petit, Pedro López, José Duato:
Exploiting subtrace-level parallelism in clustered processors. 555-556 - Sergey Blagodurov, Sergey Zhuravlev, Alexandra Fedorova, Ali Kamali:
A case for NUMA-aware contention management on multicore systems. 557-558 - Haibo Lin, Tao Liu, Huoding Li, Tong Chen, Lakshminarayanan Renganarayanan, Kevin O'Brien, Ling Shao:
DMATiler: revisiting loop tiling for direct memory access. 559-560 - Christian Bienia, Kai Li:
Scaling of the PARSEC benchmark inputs. 561-562 - Richard West, Puneet Zaroo, Carl A. Waldspurger, Xiao Zhang:
Online cache modeling for commodity multicore processors. 563-564 - Ahmed Abousamra, Rami G. Melhem, Alex K. Jones:
NoC-aware cache design for chip multiprocessors. 565-566 - Jun Lee, Sangmin Seo, Jaejin Lee:
A software-SVM-based transactional memory for multicore accelerator architectures with local memory. 567-568 - R. Manikantan, Kaushik Rajan, R. Govindarajan:
NUcache: a multicore cache organization based on next-use distance. 569-570 - Shantanu Gupta, Shuguang Feng, Amin Ansari, Ganesh S. Dasika, Scott A. Mahlke:
CoreGenesis: erasing core boundaries for robust and configurable performance. 571-572 - Rajkishore Barik, Jisheng Zhao, Vivek Sarkar:
Automatic vector instruction selection for dynamic compilation. 573-574 - Michael Mihn-Jong Lee, John Kim, Dennis Abts, Michael R. Marty, Jae W. Lee:
Approximating age-based arbitration in on-chip networks. 575-576
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.