default search action
SC 2010: New Orleans, LA, USA
- Conference on High Performance Computing Networking, Storage and Analysis, SC 2010, New Orleans, LA, USA, November 13-19, 2010. IEEE 2010, ISBN 978-1-4244-7559-9
- Tsuyoshi Hamada, Keigo Nitadori:
190 TFlops Astrophysical N-body Simulation on a Cluster of GPUs. 1-9 - Carsten Burstedde, Omar Ghattas, Michael Gurnis, Tobin Isaac, Georg Stadler, Tim Warburton, Lucas C. Wilcox:
Extreme-Scale AMR. 1-12 - Yifeng Cui, Kim B. Olsen, Thomas H. Jordan, Kwangyoon Lee, Jun Zhou, Patrick Small, Daniel Roten, Geoffrey Ely, Dhabaleswar K. Panda, Amit Chourasia, John M. Levesque, Steven M. Day, Philip Maechling:
Scalable Earthquake Simulation on Petascale Supercomputers. 1-20 - Amanda Peters Randles, Simone Melchionna, Efthimios Kaxiras, Jonas Lätt, Joy K. Sircar, Massimo Bernaschi, Mauro Bisson, Sauro Succi:
Multiscale Simulation of Cardiovascular flows on the IBM Bluegene/P: Full Heart-Circulation System at Red-Blood Cell Resolution. 1-10 - Abtin Rahimian, Ilya Lashuk, Shravan K. Veerapaneni, Aparna Chandramowlishwaran, Dhairya Malhotra, Logan Moon, Rahul S. Sampath, Aashay Shringarpure, Jeffrey S. Vetter, Richard W. Vuduc, Denis Zorin, George Biros:
Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures. 1-11 - Anton Kozhevnikov, Adolfo G. Eguiluz, Thomas C. Schulthess:
Toward First Principles Electronic Structure Simulations of Excited States and Strong Correlations in Nano- and Materials Science. 1-10 - Fengguang Song, Hatem Ltaief, Bilel Hadri, Jack J. Dongarra:
Scalable Tile Communication-Avoiding QR Factorization on Multicore Cluster Systems. 1-11 - Rahul S. Sampath, Hari Sundar, Shravan K. Veerapaneni:
Parallel Fast Gauss Transform. 1-10 - Jun Doi, Yasushi Negishi:
Overlapping Methods of All-to-All Communication and FFT Algorithms for Torus-Connected Massively Parallel Supercomputers. 1-9 - Hanjoon Kim, Seulki Heo, Junghoon Lee, Jaehyuk Huh, John Kim:
On-Chip Network Evaluation Framework. 10 - Gilbert Hendry, Eric Robinson, Vitaliy Gleyzer, Johnnie Chan, Luca P. Carloni, Nadya Travinin Bliss, Keren Bergman:
Circuit-Switched Memory Access in Photonic Interconnection Networks for High-Performance Embedded Computing. 1-12 - Asit K. Mishra, Shekhar Srikantaiah, Mahmut T. Kandemir, Chita R. Das:
CPM in CMPs: Coordinated Power Management in Chip-Multiprocessors. 1-12 - Akira Hosoi, Takumi Washio, Jun-ichi Okada, Yoshimasa Kadooka, Kengo Nakajima, Toshiaki Hisada:
A Multi-Scale Heart Simulation on Massively Parallel Computers. 1-11 - Anthony D. Nguyen, Nadathur Satish, Jatin Chhugani, Changkyu Kim, Pradeep Dubey:
3.5-D Blocking Optimization for Stencil Computations on Modern CPUs and GPUs. 1-13 - Takashi Shimokawabe, Takayuki Aoki, Chiashi Muroi, Junichi Ishida, Kohei Kawano, Toshio Endo, Akira Nukada, Naoya Maruyama, Satoshi Matsuoka:
An 80-Fold Speedup, 15.0 TFlops Full GPU Acceleration of Non-Hydrostatic Weather Model ASUCA Production Code. 1-11 - Adrian M. Caulfield, Joel Coburn, Todor I. Mollov, Arup De, Ameen Akel, Jiahua He, Arun Jagatheesan, Rajesh K. Gupta, Allan Snavely, Steven Swanson:
Understanding the Impact of Emerging Non-Volatile Memories on High-Performance, IO-Intensive Computing. 1-11 - Jiahua He, Arun Jagatheesan, Sandeep K. S. Gupta, Jeffrey Bennett, Allan Snavely:
DASH: a Recipe for a Flash-based Data Intensive Supercomputer. 1-11 - Xiangyu Dong, Yuan Xie, Naveen Muralimanohar, Norman P. Jouppi:
Simple but Effective Heterogeneous Main Memory with On-Chip Memory Controller Support. 1-11 - Ron O. Dror, J. P. Grossman, Kenneth M. Mackenzie, Brian Towles, Edmond Chow, John K. Salmon, Cliff Young, Joseph A. Bank, Brannon Batson, Martin M. Deneroff, Jeffrey Kuskin, Richard H. Larson, Mark A. Moraes, David E. Shaw:
Exploiting 162-Nanosecond End-to-End Communication Latency on Anton. 1-12 - Ardalan Kangarlou, Sahan Gamage, Ramana Rao Kompella, Dongyan Xu:
vSnoop: Improving TCP Throughput in Virtualized Environments via Acknowledgement Offload. 1-11 - Mehmet Balman, Evangelos Chaniotakis, Arie Shoshani, Alex Sim:
A Flexible Reservation Algorithm for Advance Network Provisioning. 1-11 - Adam Moody, Greg Bronevetsky, Kathryn M. Mohror, Bronis R. de Supinski:
Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System. 1-11 - Mark Giampapa, Thomas Gooding, Todd Inglett, Robert W. Wisniewski:
Experiences with a Lightweight Supercomputer Kernel: Lessons Learned from Blue Gene's CNK. 1-10 - Torsten Hoefler, Timo Schneider, Andrew Lumsdaine:
Characterizing the Influence of System Noise on Large-Scale Applications by Simulation. 1-11 - Guojing Cong, George Almási, Vijay A. Saraswat:
Fast PGAS Implementation of Distributed Graph Algorithms. 1-11 - Roger A. Pearce, Maya B. Gokhale, Nancy M. Amato:
Multithreaded Asynchronous Graph Traversal for In-Memory and Semi-External Memory. 1-11 - Virat Agarwal, Fabrizio Petrini, Davide Pasetto, David A. Bader:
Scalable Graph Exploration on Multicore Processors. 1-11 - Jae-Seung Yeom, Dimitrios S. Nikolopoulos:
Strider: Runtime Support for Optimizing Strided Data Accesses on Multi-Cores with Explicitly Managed Memories. 1-11 - Louis-Noël Pouchet, Uday Bondhugula, Cédric Bastoul, Albert Cohen, J. Ramanujam, P. Sadayappan:
Combined Iterative and Model-driven Optimization in an Automatic Parallelization Framework. 1-11 - Preeti Malakar, Vijay Natarajan, Sathish S. Vadhiyar:
An Adaptive Framework for Simulation and Online Remote Visualization of Critical Climate Applications in Resource-constrained Environments. 1-11 - Seyong Lee, Rudolf Eigenmann:
OpenMPC: Extended OpenMP Programming and Tuning for GPUs. 1-11 - Timothy G. Mattson, Michael Riepen, Thomas Lehnig, Paul Brett, Werner Haas, Patrick Kennedy, Jason Howard, Sriram R. Vangal, Nitin Borkar, Gregory Ruhl, Saurabh Dighe:
The 48-core SCC Processor: the Programmer's View. 1-11 - Beverly A. Sanders, Rodney J. Bartlett, Erik Deumens, Victor Lotrich, Mark Ponton:
A Block-Oriented Language and Runtime System for Tensor Algebra with Very Large Arrays. 1-11 - Venkatram Vishwanath, Mark Hereld, Kamil Iskra, Dries Kimpe, Vitali A. Morozov, Michael E. Papka, Robert B. Ross, Kazutomo Yoshii:
Accelerating I/O Forwarding in IBM Blue Gene/P Systems. 1-10 - Jay F. Lofstead, Fang Zheng, Qing Liu, Scott Klasky, Ron A. Oldfield, Todd Kordenbrock, Karsten Schwan, Matthew Wolf:
Managing Variability in the IO Performance of Petascale Storage Systems. 1-12 - Xuechen Zhang, Kei Davis, Song Jiang:
IOrchestrator: Improving the Performance of Multi-node I/O Systems via Inter-Server Coordination. 1-11 - Anh Vo, Sriram Aananthakrishnan, Ganesh Gopalakrishnan, Bronis R. de Supinski, Martin Schulz, Greg Bronevetsky:
A Scalable and Distributed Dynamic Formal Verifier for MPI Programs. 1-10 - Zhezhe Chen, Qi Gao, Wenbin Zhang, Feng Qin:
FlowChecker: Detecting Bugs in MPI Libraries via Message Flow Checking. 1-11 - Nathan R. Tallent, Laksono Adhianto, John M. Mellor-Crummey:
Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles. 1-11 - Xiaodan Wang, Eric A. Perlman, Randal C. Burns, Tanu Malik, Tamas Budavari, Charles Meneveau, Alexander S. Szalay:
JAWS: Job-Aware Workload Scheduling for the Exploration of Turbulence Simulations. 1-11 - Thorvald Natvig, Anne C. Elster, Jan Christian Meyer:
Automatic Run-time Parallelization and Transformation of I/O. 1-10 - Min Li, Sudharshan S. Vazhkudai, Ali Raza Butt, Fei Meng, Xiaosong Ma, Youngjae Kim, Christian Engelmann, Galen M. Shipman:
Functional Partitioning to Optimize End-to-End Performance on Many-core Architectures. 1-12 - Aparna Chandramowlishwaran, Kamesh Madduri, Richard W. Vuduc:
Diagnosis, Tuning, and Redesign for Multicore Performance: A Case Study of the Fast Multipole Method. 1-12 - Guy E. Blelloch, Ioannis Koutis, Gary L. Miller, Kanat Tangwongsan:
Hierarchical Diagonal Blocking and Precision Reduction Applied to Combinatorial Multigrid. 1-12 - Mohammad Jowkar, Raúl de la Cruz, José María Cela:
Exploring a Novel Gathering Method for Finite Element Codes on the Cell/B.E. Architecture. 1-11 - Pritish Jetley, Lukasz Wesolowski, Filippo Gioachin, Laxmikant V. Kalé, Thomas R. Quinn:
Scaling Hierarchical N-body Simulations on GPU Clusters. 1-11 - Abdullah Gharaibeh, Matei Ripeanu:
Size Matters: Space/Time Tradeoffs to Improve GPGPU Applications Performance. 1-12 - Scott S. Hampton, Sadaf R. Alam, Paul S. Crozier, Pratul K. Agarwal:
Optimal Utilization of Heterogeneous Resources for Biomolecular Simulations. 1-11 - David Chiu, Apeksha Shetty, Gagan Agrawal:
Elastic Cloud Caches for Accelerating Service-Oriented Computations. 1-11 - Gideon Juve, Ewa Deelman, Karan Vahi, Gaurang Mehta, G. Bruce Berriman, Benjamin P. Berman, Philip Maechling:
Data Sharing Options for Scientific Workflows on Amazon EC2. 1-9 - Qian Zhu, Jiedan Zhu, Gagan Agrawal:
Power-Aware Consolidation of Scientific Workflows in Virtualized Environments. 1-12 - Ronald Babich, Michael A. Clark, Bálint Joó:
Parallelizing the QUDA Library for Multi-GPU Calculations in Lattice Quantum Chromodynamics. 1-11 - Mathieu Luisier:
A Parallel Implementation of Electron-Phonon Scattering in Nanoelectronic Devices up to 95k Cores. 1-11 - Jan Götz, Klaus Iglberger, Markus Stürmer, Ulrich Rüde:
Direct Numerical Simulation of Particulate Flows on 294912 Processor Cores. 1-11 - David Tarjan, Kevin Skadron:
The Sharing Tracker: Using Ideas from Cache Coherence Hardware to Reduce Off-Chip Memory Traffic with Non-Coherent Caches. 1-10 - Andreas Sandberg, David Eklov, Erik Hagersten:
Reducing Cache Pollution Through Detection and Elimination of Non-Temporal Memory Accesses. 1-11 - Martin Burtscher, Byoung-Do Kim, Jeffrey R. Diamond, John D. McCalpin, Lars Koesterke, James C. Browne:
PerfExpert: An Easy-to-Use Performance Diagnosis Tool for HPC Applications. 1-11
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.