Architectures

Applied Filters

People

Publications

Conferences

Publication Date

29 Results for: Book/Issue: SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,765,100 records)|Limit your search to The ACM Full-Text Collection (758,141 records)

Showing 1 - 20of29 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
November 2008
369 Tflop/s molecular dynamics simulations on the Roadrunner general-purpose heterogeneous supercomputer
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 64, Pages 1–10

We present timing and performance numbers for a short-range parallel molecular dynamics (MD) code, SPaSM, that has been rewritten for the heterogeneous Roadrunner supercomputer. Each Roadrunner compute node consists of two AMD Opteron dualcore ...
5
422
Metrics
Total Citations5
Total Downloads422
Last 12 Months2
Last 6 weeks0
Get Access
research-article
November 2008
0.374 Pflop/s trillion-particle kinetic modeling of laser plasma interaction on Roadrunner
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 63, Pages 1–11

We demonstrate the outstanding performance and scalability of the VPIC kinetic plasma modeling code on the heterogeneous IBM Roadrunner supercomputer at Los Alamos National Laboratory. VPIC is a three-dimensional, relativistic, electromagnetic, particle-...
14
696
Metrics
Total Citations14
Total Downloads696
Last 12 Months42
Last 6 weeks1
Get Access
research-article
November 2008
Prefetch throttling and data pinning for improving performance of shared caches
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 59, Pages 1–12

In this paper, we (i) quantify the impact of compiler-directed I/O prefetching on shared caches at I/O nodes. The experimental data collected shows that while I/O prefetching brings some benefits, its effectiveness reduces significantly as the number of ...
0
336
Metrics
Total Citations0
Total Downloads336
Last 12 Months2
Last 6 weeks1
Get Access
research-article
November 2008
Global trees: a framework for linked data structures on distributed memory parallel systems
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 57, Pages 1–13

This paper describes the Global Trees (GT) system that provides a multi-layered interface to a global address space view of distributed tree data structures, while providing scalable performance on distributed memory systems. The Global Trees system ...
4
482
Metrics
Total Citations4
Total Downloads482
Last 12 Months1
Last 6 weeks0
Get Access
research-article
November 2008
Server-storage virtualization: integration and load balancing in data centers
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 53, Pages 1–12

We describe the design of an agile data center with integrated server and storage virtualization technologies. Such data centers form a key building block for new cloud computing architectures. We also show how to leverage this integrated agility for ...
40
3,560
Metrics
Total Citations40
Total Downloads3,560
Last 12 Months3
Last 6 weeks0
Get Access
research-article
November 2008
Analysis of application heartbeats: learning structural and temporal features in time series data for identification of performance problems
- Emma S. Buneci,
- Daniel A. Reed
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 52, Pages 1–12

Grids promote new modes of scientific collaboration and discovery by connecting distributed instruments, data and computing facilities. Because many resources are shared, application performance can vary widely and unexpectedly. We describe a novel ...
4
321
Metrics
Total Citations4
Total Downloads321
Last 12 Months1
Last 6 weeks0
Get Access
research-article
November 2008
The cost of doing science on the cloud: the Montage example
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 50, Pages 1–12

Utility grids such as the Amazon EC2 cloud and Amazon S3 offer computational and storage resources that can be used on-demand for a fee by compute and data-intensive applications. The cost of running an application on such a cloud depends on the compute,...
78
2,348
Metrics
Total Citations78
Total Downloads2,348
Last 12 Months2
Last 6 weeks0
Get Access
research-article
November 2008
Scalable load-balance measurement for SPMD codes
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 46, Pages 1–12

Good load balance is crucial on very large parallel systems, but the most sophisticated algorithms introduce dynamic imbalances through adaptation in domain decomposition or use of adaptive solvers. To observe and diagnose imbalance, developers need ...
15
350
Metrics
Total Citations15
Total Downloads350
Last 12 Months2
Last 6 weeks0
Get Access
research-article
November 2008
BitDew: a programmable environment for large-scale data management and distribution
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 45, Pages 1–12

Desktop Grids use the computing, network and storage resources from idle desktop PC's distributed over multiple-LAN's or the Internet to compute a large variety of resource-demanding distributed applications. While these applications need to access, ...
11
389
Metrics
Total Citations11
Total Downloads389
Last 12 Months4
Last 6 weeks0
Get Access
research-article
November 2008
Proactive process-level live migration in HPC environments
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 43, Pages 1–12

As the number of nodes in high-performance computing environments keeps increasing, faults are becoming common place. Reactive fault tolerance (FT) often does not scale due to massive I/O requirements and relies on manual job resubmission.

This work ...
33
823
Metrics
Total Citations33
Total Downloads823
Last 12 Months2
Last 6 weeks0
Get Access
research-article
November 2008
A dynamic scheduler for balancing HPC applications
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 41, Pages 1–12

Load imbalance cause significant performance degradation in High Performance Computing applications. In our previous work we showed that load imbalance can be alleviated by modern MT processors that provide mechanisms for controlling the allocation of ...
11
665
Metrics
Total Citations11
Total Downloads665
Last 12 Months3
Last 6 weeks0
Get Access
research-article
November 2008
PAM: a novel performance/power aware meta-scheduler for multi-core systems
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 39, Pages 1–12

Sharing resources such as caches and main memory bandwidth in multi-core systems requires a more sophisticated scheduling scheme. PAM is a low-overhead, user-level meta-scheduler which does not require any hardware or software changes. In particular, it ...
20
585
Metrics
Total Citations20
Total Downloads585
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2008
An adaptive cut-off for task parallelism
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 36, Pages 1–11

In task parallel languages, an important factor for achieving a good performance is the use of a cut-off technique to reduce the number of tasks created. Using a cut-off to avoid an excessive number of tasks helps the runtime system to reduce the total ...
36
526
Metrics
Total Citations36
Total Downloads526
Last 12 Months4
Last 6 weeks0
Get Access
research-article
November 2008
Massively parallel genomic sequence search on the Blue Gene/P architecture
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 33, Pages 1–11

This paper presents our first experiences in mapping and optimizing genomic sequence search onto the massively parallel IBM Blue Gene/P (BG/P) platform. Specifically, we performed our work on mpiBLAST, a parallel sequence-search code that has been ...
7
365
Metrics
Total Citations7
Total Downloads365
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2008
High-radix crossbar switches enabled by proximity communication
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 32, Pages 1–12

We describe a novel way to implement high-radix crossbar switches. Our work is enabled by a new chip interconnect technology called Proximity Communication (PxC) that offers unparalleled chip IO density. First, we show how a crossbar architecture is ...
4
293
Metrics
Total Citations4
Total Downloads293
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2008
Extending CC-NUMA systems to support write update optimizations
- Liqun Cheng,
- John B. Carter
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 30, Pages 1–12

Processor stalls and protocol messages caused by coherence misses limit the performance of shared memory applications. Modern multiprocessors employ write-invalidate coherence protocols, which induce read misses to ensure consistency. Previous research ...
2
283
Metrics
Total Citations2
Total Downloads283
Last 12 Months10
Last 6 weeks0
Get Access
research-article
November 2008
A novel migration-based NUCA design for chip multiprocessors
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 28, Pages 1–12

Chip Multiprocessors (CMFs) and Non-Uniform Cache Architectures (NUCAs) represent two emerging trends in computer architecture. Targeting future CMP based systems with NUCA type L2 caches, this paper proposes a novel data migration algorithm for ...
18
470
Metrics
Total Citations18
Total Downloads470
Last 12 Months1
Last 6 weeks0
Get Access
research-article
November 2008
Applying double auctions for scheduling of workflows on the Grid
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 27, Pages 1–11

Grid economy models have long been considered as a promising alternative for the classical Grid resource management, due to their dynamic and decentralized nature, and because the financial valuation of resources and services is inherent in any such ...
2
287
Metrics
Total Citations2
Total Downloads287
Last 12 Months0
Last 6 weeks0
Get Access
research-article
November 2008
SMARTMAP: operating system support for efficient data sharing among processes on a multi-core processor
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 25, Pages 1–12

This paper describes SMARTMAP, an operating system technique that implements fixed offset virtual memory addressing. SMARTMAP allows the application processes on a multi-core processor to directly access each other's memory without the overhead of ...
8
315
Metrics
Total Citations8
Total Downloads315
Last 12 Months2
Last 6 weeks0
Get Access
research-article
November 2008
Nimrod/K: towards massively parallel dynamic grid workflows
SC '08: Proceedings of the 2008 ACM/IEEE conference on SupercomputingArticle No.: 24, Pages 1–11

A challenge for Grid computing is the difficulty in developing software that is parallel, distributed and highly dynamic. Whilst there have been many general purpose mechanisms developed over the years, Grid programming still remains a low level, error ...
12
333
Metrics
Total Citations12
Total Downloads333
Last 12 Months1
Last 6 weeks0
Get Access