Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2008
Linearly scaling 3D fragment method for large-scale electronic structure calculations
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 65, Pages 1–10We present a new linearly scaling three-dimensional fragment (LS3DF) method for large scale ab initio electronic structure calculations. LS3DF is based on a divide-and-conquer approach, which incorporates a novel patching scheme that effectively cancels ...
- research-articleNovember 2008
369 Tflop/s molecular dynamics simulations on the Roadrunner general-purpose heterogeneous supercomputer
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 64, Pages 1–10We present timing and performance numbers for a short-range parallel molecular dynamics (MD) code, SPaSM, that has been rewritten for the heterogeneous Roadrunner supercomputer. Each Roadrunner compute node consists of two AMD Opteron dualcore ...
- research-articleNovember 2008
0.374 Pflop/s trillion-particle kinetic modeling of laser plasma interaction on Roadrunner
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 63, Pages 1–11We demonstrate the outstanding performance and scalability of the VPIC kinetic plasma modeling code on the heterogeneous IBM Roadrunner supercomputer at Los Alamos National Laboratory. VPIC is a three-dimensional, relativistic, electromagnetic, particle-...
- research-articleNovember 2008
Scalable adaptive mantle convection simulation on petascale supercomputers
- Carsten Burstedde,
- Omar Ghattas,
- Michael Gurnis,
- Georg Stadler,
- Eh Tan,
- Tiankai Tu,
- Lucas C. Wilcox,
- Shijie Zhong
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 62, Pages 1–15Mantle convection is the principal control on the thermal and geological evolution of the Earth. Mantle convection modeling involves solution of the mass, momentum, and energy equations for a viscous, creeping, incompressible non-Newtonian fluid at high ...
- research-articleNovember 2008
New algorithm to enable 400+ TFlop/s sustained performance in simulations of disorder effects in high-Tc superconductors
- G. Alvarez,
- M. S. Summers,
- D. E. Maxwell,
- M. Eisenbach,
- J. S. Meredith,
- J. M. Larkin,
- J. Levesque,
- T. A. Maier,
- P. R. C. Kent,
- E. F. D'Azevedo,
- T. C. Schulthess
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 61, Pages 1–10Staggering computational and algorithmic advances in recent years now make possible systematic Quantum Monte Carlo (QMC) simulations of high temperature (high-Tc) superconductivity in a microscopic model, the two dimensional (2D) Hubbard model, with ...
-
- research-articleNovember 2008
High-frequency simulations of global seismic wave propagation using SPECFEM3D_GLOBE on 62K processors
- Laura Carrington,
- Dimitri Komatitsch,
- Michael Laurenzano,
- Mustafa M Tikir,
- David Michéa,
- Nicolas Le Goff,
- Allan Snavely,
- Jeroen Tromp
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 60, Pages 1–11SPECFEM3D_GLOBE is a spectral-element application enabling the simulation of global seismic wave propagation in 3D anelastic, anisotropic, rotating and self-gravitating Earth models at unprecedented resolution. A fundamental challenge in global ...
- research-articleNovember 2008
Prefetch throttling and data pinning for improving performance of shared caches
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 59, Pages 1–12In this paper, we (i) quantify the impact of compiler-directed I/O prefetching on shared caches at I/O nodes. The experimental data collected shows that while I/O prefetching brings some benefits, its effectiveness reduces significantly as the number of ...
- research-articleNovember 2008
Parallel exact inference on the cell broadband engine processor
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 58, Pages 1–12We present the design and implementation of a parallel exact inference algorithm on the Cell Broadband Engine (Cell BE). Exact inference is a key problem in exploring probabilistic graphical models. In such a model, the computation complexity increases ...
- research-articleNovember 2008
Global trees: a framework for linked data structures on distributed memory parallel systems
- D. Brian Larkins,
- James Dinan,
- Sriram Krishnamoorthy,
- Srinivasan Parthasarathy,
- Atanas Rountev,
- P. Sadayappan
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 57, Pages 1–13This paper describes the Global Trees (GT) system that provides a multi-layered interface to a global address space view of distributed tree data structures, while providing scalable performance on distributed memory systems. The Global Trees system ...
- research-articleNovember 2008
A scalable parallel framework for analyzing terascale molecular dynamics simulation trajectories
- Tiankai Tu,
- Charles A. Rendleman,
- David W. Borhani,
- Ron O. Dror,
- Justin Gullingsrud,
- Morten Ø. Jensen,
- John L. Klepeis,
- Paul Maragakis,
- Patrick Miller,
- Kate A. Stafford,
- David E. Shaw
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 56, Pages 1–12As parallel algorithms and architectures drive the longest molecular dynamics (MD) simulations towards the millisecond scale, traditional sequential post-simulation data analysis methods are becoming increasingly untenable. Inspired by the programming ...
- research-articleNovember 2008
Positivity, posynomials and tile size selection
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 55, Pages 1–12Tiling is a widely used loop transformation for exposing/exploiting parallelism and data locality. Effective use of tiling requires selection and tuning of the tile sizes. This is usually achieved by developing cost models that characterize the ...
- research-articleNovember 2008
Materialized community ground models for large-scale earthquake simulation
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 54, Pages 1–12Large-scale earthquake simulation requires source datasets which describe the highly heterogeneous physical characteristics of the earth in the region under simulation. Physical characteristic datasets are the first stage in a simulation pipeline which ...
- research-articleNovember 2008
Server-storage virtualization: integration and load balancing in data centers
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 53, Pages 1–12We describe the design of an agile data center with integrated server and storage virtualization technologies. Such data centers form a key building block for new cloud computing architectures. We also show how to leverage this integrated agility for ...
- research-articleNovember 2008
Analysis of application heartbeats: learning structural and temporal features in time series data for identification of performance problems
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 52, Pages 1–12Grids promote new modes of scientific collaboration and discovery by connecting distributed instruments, data and computing facilities. Because many resources are shared, application performance can vary widely and unexpectedly. We describe a novel ...
- research-articleNovember 2008
High performance multivariate visual data exploration for extremely large data
- Oliver Rübel,
- Prabhat,
- Kesheng Wu,
- Hank Childs,
- Jeremy Meredith,
- Cameron G. R. Geddes,
- Estelle Cormier-Michel,
- Sean Ahern,
- Gunther H. Weber,
- Peter Messmer,
- Hans Hagen,
- Bernd Hamann,
- E. Wes Bethel
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 51, Pages 1–12One of the central challenges in modern science is the need to quickly derive knowledge and understanding from large, complex collections of data. We present a new approach that deals with this challenge by combining and extending techniques from high ...
- research-articleNovember 2008
The cost of doing science on the cloud: the Montage example
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 50, Pages 1–12Utility grids such as the Amazon EC2 cloud and Amazon S3 offer computational and storage resources that can be used on-demand for a fee by compute and data-intensive applications. The cost of running an application on such a cloud depends on the compute,...
- research-articleNovember 2008
Capturing performance knowledge for automated analysis
- Kevin A. Huck,
- Oscar Hernandez,
- Van Bui,
- Sunita Chandrasekaran,
- Barbara Chapman,
- Allen D. Malony,
- Lois Curfman McInnes,
- Boyana Norris
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 49, Pages 1–10Automating the process of parallel performance experimentation, analysis, and problem diagnosis can enhance environments for performance-directed application development, compilation, and execution. This is especially true when parametric studies, ...
- research-articleNovember 2008
Massively parallel volume rendering using 2-3 swap image compositing
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 48, Pages 1–11The ever-increasing amounts of simulation data produced by scientists demand high-end parallel visualization capability. However, image compositing, which requires interprocessor communication, is often the bottleneck stage for parallel rendering of ...
- research-articleNovember 2008
Using overlays for efficient data transfer over shared wide-area networks
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 47, Pages 1–12Data-intensive applications frequently transfer large amounts of data over wide-area networks. The performance achieved in such settings can often be improved by routing data via intermediate nodes chosen to increase aggregate bandwidth. We explore the ...
- research-articleNovember 2008
Scalable load-balance measurement for SPMD codes
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 46, Pages 1–12Good load balance is crucial on very large parallel systems, but the most sophisticated algorithms introduce dynamic imbalances through adaptation in domain decomposition or use of adaptive solvers. To observe and diagnose imbalance, developers need ...