default search action
15th PPOPP 2010: Bangalore, India
- R. Govindarajan, David A. Padua, Mary W. Hall:
Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2010, Bangalore, India, January 9-14, 2010. ACM 2010, ISBN 978-1-60558-877-3 - Tilak Agerwala:
Exascale computing: the challenges and opportunities in the next decade. 1-2
Analysis and optimization of parallel programs
- Mario Méndez-Lojo, Donald Nguyen, Dimitrios Prountzos, Xin Sui, Muhammad Amber Hassaan, Milind Kulkarni, Martin Burtscher, Keshav Pingali:
Structure-driven optimizations for amorphous data-parallel programs. 3-14 - Katherine E. Coons, Sebastian Burckhardt, Madanlal Musuvathi:
GAMBIT: effective unit testing for concurrency libraries. 15-24 - Jonathan K. Lee, Jens Palsberg:
Featherweight X10: a core calculus for async-finish parallelism. 25-36 - Sandya Mannarswamy, Dhruva R. Chakrabarti, Kaushik Rajan, Sujoy Saraswati:
Compiler aided selective lock assignment for improving the performance of software transactional memory. 37-46
Supporting transactional memory
- Christopher J. Rossbach, Owen S. Hofmann, Emmett Witchel:
Is transactional programming actually easier? 47-56 - Ferad Zyulkyarov, Tim Harris, Osman S. Unsal, Adrián Cristal, Mateo Valero:
Debugging programs that use atomic blocks and transactional memory. 57-66 - Luke Dalessandro, Michael F. Spear, Michael L. Scott:
NOrec: streamlining STM by abolishing ownership records. 67-78 - Walther Maldonado, Patrick Marlier, Pascal Felber, Adi Suissa, Danny Hendler, Alexandra Fedorova, Julia L. Lawall, Gilles Muller:
Scheduling support for transactional memory contention management. 79-90 - João Pedro Barreto, Aleksandar Dragojevic, Paulo Ferreira, Rachid Guerraoui, Michal Kapalka:
Leveraging parallel nesting in transactional memory. 91-100
Panel
- Josep Torrellas, Bill Gropp, Jaime H. Moreno, Kunle Olukotun, Vivek Sarkar:
Extreme scale computing: challenges and opportunities. 101-102 - Arvind:
Is hardware innovation over? 103-104
Parallel libraries, applications and tools on GPUs
- Sara S. Baghsorkhi, Matthieu Delahaye, Sanjay J. Patel, William D. Gropp, Wen-mei W. Hwu:
An adaptive performance modeling tool for GPU architectures. 105-114 - JeeWhan Choi, Amik Singh, Richard W. Vuduc:
Model-driven autotuning of sparse matrix-vector multiply on GPUs. 115-126 - Yao Zhang, Jonathan Cohen, John D. Owens:
Fast tridiagonal solvers on the GPU. 127-136 - Edans Flavius de Oliveira Sandes, Alba Cristina Magalhaes Alves de Melo:
CUDAlign: using GPU to accelerate the comparison of megabase genomic sequences. 137-146
Scalable parallel computing
- Steven A. Hofmeyr, Costin Iancu, Filip Blagojevic:
Load balancing on speed. 147-158 - Torsten Hoefler, Christian Siebert, Andrew Lumsdaine:
Scalable communication protocols for dynamic sparse data exchange. 159-168 - John W. Romein, P. Chris Broekema, Jan David Mol, Rob van Nieuwpoort:
The LOFAR correlator: implementation and performance analysis. 169-178
Thread management and scheduling
- Alexandros Tzannes, George C. Caragea, Rajeev Barua, Uzi Vishkin:
Lazy binary-splitting: a run-time adaptive work-stealing scheduler. 179-190 - Petar Radojkovic, Vladimir Cakarevic, Javier Verdú, Alex Pajuelo, Francisco J. Cazorla, Mario Nemirovsky, Mateo Valero:
Thread to strand binding of parallel network applications in massive multi-threaded systems. 191-202
Locality-aware parallelism
- Eddy Z. Zhang, Yunlian Jiang, Xipeng Shen:
Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs? 203-212 - Lixia Liu, Zhiyuan Li:
Improving parallelism and locality with asynchronous algorithms. 213-222 - Anthony M. Castaldo, R. Clint Whaley:
Scaling LAPACK panel operations using parallel cache assignment. 223-232 - Dean F. Sutherland, William L. Scherlis:
Composable thread coloring. 233-244
Locks and concurrent data structures
- Kunal Agrawal, Charles E. Leiserson, Jim Sukha:
Helper locks for fork-join parallel programming. 245-256 - Nathan Grasso Bronson, Jared Casper, Hassan Chafi, Kunle Olukotun:
A practical concurrent binary search tree. 257-268 - Nathan R. Tallent, John M. Mellor-Crummey, Allan Porterfield:
Analyzing lock contention in multithreaded applications. 269-280 - Gautam Upadhyaya, Samuel P. Midkiff, Vijay S. Pai:
Using data structure knowledge for efficient lock generation and strong atomicity. 281-292
Performance modeling and prediction
- Qasim Ali, Samuel P. Midkiff, Vijay S. Pai:
Modeling advanced collective communication algorithms on cell-based systems. 293-304 - Jidong Zhai, Wenguang Chen, Weimin Zheng:
PHANTOM: predicting performance of parallel applications on large-scale parallel machines using a single node. 305-314 - Farhana Aleen, Monirul Sharif, Santosh Pande:
Input-driven dynamic execution prediction of streaming applications. 315-324
Posters
- Daniel Lupei, Bogdan Simion, Don Pinto, Matthew Misler, Mihai Burcea, William Krick, Cristiana Amza:
Towards scalable and transparent parallelization of multiplayer games using transactional memory support. 325-326 - Swann Perarnau, Guillaume Huard:
KRASH: reproducible CPU load generation on many cores machines. 327-328 - Sai Prashanth Muralidhara, Mahmut T. Kandemir, Padma Raghavan:
Intra-application shared cache partitioning for multithreaded applications. 329-330 - Alokika Dash, Brian Demsky:
Symbolic prefetching in transactional distributed shared memory. 331-332 - Dhruva R. Chakrabarti:
New abstractions for effective performance analysis of STM programs. 333-334 - Chao Zhang, Chen Ding, Xiaoming Gu, Kirk Kelsey, Tongxin Bai, Xiaobing Feng:
Continuous speculative program parallelization in software. 335-336 - Vladimir Marjanovic, Jesús Labarta, Eduard Ayguadé, Mateo Valero:
Effective communication and computation overlap with hybrid MPI/SMPSs. 337-338 - Daniel Cederman, Philippas Tsigas:
Supporting lock-free composition of concurrent data objects. 339-340 - Yi Guo, Yisheng Zhao, Vincent Cavé, Vivek Sarkar:
SLAW: a scalable locality-aware adaptive work-stealing scheduler for multi-core systems. 341-342 - Yi Yang, Ping Xiang, Jingfei Kong, Huiyang Zhou:
An optimizing compiler for GPGPU programs with input-data sharing. 343-344 - Aparna Chandramowlishwaran, Kathleen Knobe, Richard W. Vuduc:
Applying the concurrent collections programming model to asynchronous parallel dense linear algebra. 345-346 - Henry Hoffmann, Jonathan Eastep, Marco D. Santambrogio, Jason E. Miller, Anant Agarwal:
Application heartbeats for software performance and health. 347-348 - Donald E. Porter, Emmett Witchel:
Modeling transactional memory workload performance. 349-350 - John D. Carter, William B. Gardner, Gary Gréwal:
The pilot library for novice MPI programmers. 351-352 - Byunghyun Jang, Perhaad Mistry, Dana Schaa, Rodrigo Dominguez, David R. Kaeli:
Data transformations enabling loop vectorization on multithreaded data parallel architectures. 353-354 - Gregory Buehrer, Srinivasan Parthasarathy, Shirish Tatikonda:
A distributed placement service for graph-structured and tree-structured data. 355-356 - Guodong Li, Ganesh Gopalakrishnan, Robert M. Kirby, Daniel J. Quinlan:
A symbolic verifier for CUDA programs. 357-358
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.