SAGE-HPCA: Vol 26, No 2

Volume 26, Issue 2May 2012

Volume 26, Issue 2

May 2012

Publisher:

Sage Publications, Inc.
2455 Teller Road Thousand Oaks, CA
United States

ISSN:1094-3420

Tags:

virtualization
FFT
GPU
OpenMP
Operating systems

Get Alerts for this PeriodicalAlerts Save to BinderBinder Export CitationCitation

Share on

Reflects downloads up to 30 Sep 2024Bibliometrics

Citation count

Downloads (6 weeks)

Downloads (12 months)

Downloads (cumulative)

Sections

Volume 26 , Issue 2

May 2012

PreviousIssue NextIssue

Skip Table Of Content Section

Select All

Export Citations Save to Binder

other

Operating systems and runtime environments on supercomputers

Torsten Hoefler,
Kamil Iskra

Pages 93–94https://doi.org/10.1177/1094342012442456

- 0
Metrics
Total Citations0

research-article

A lightweight virtual machine monitor for Blue Gene/P

Jan Stoess,
Udo Steinberg,
Volkmar Uhlig,
Jens Kehne,
Jonathan Appavoo,
Amos Waterland

Pages 95–109https://doi.org/10.1177/1094342011434815

In this paper, we present a lightweight, micro-kernel-based virtual machine monitor (VMM) for the Blue Gene/P supercomputer. Our VMM comprises a small µ-kernel with virtualization capabilities and, atop, a user-level VMM component that manages virtual ...

- 0
Metrics
Total Citations0

Abstract

research-article

OpenMP task scheduling strategies for multicore NUMA systems

Stephen L Olivier,
Allan K Porterfield,
Kyle B Wheeler,
Michael Spiegel,
Jan F Prins

Pages 110–124https://doi.org/10.1177/1094342011434065

The recent addition of task parallelism to the OpenMP shared memory API allows programmers to express concurrency at a high level of abstraction and places the burden of scheduling parallel execution on the OpenMP run-time system. Efficient scheduling ...

- 35
Metrics
Total Citations35

Abstract

research-article

Virtual-machine-based emulation of future generation high-performance computing systems

Patrick G Bridges,
Dorian Arnold,
Kevin T Pedretti,
Madhav Suresh,
Feng Lu,
Peter Dinda,
Russ Joseph,
Jack Lange

Pages 125–135https://doi.org/10.1177/1094342012436619

This paper describes the design of a system to enable research, development, and testing of new software stacks and hardware features for future high-end computing systems. Motivating uses include both small-scale research and development on simulated ...

- 5
Metrics
Total Citations5

Abstract

research-article

Linux kernel co-scheduling and bulk synchronous parallelism

Terry Jones

Pages 136–145https://doi.org/10.1177/1094342011433523

This paper describes a kernel scheduling algorithm that is based on co-scheduling principles and that is intended for parallel applications running on 1000 cores or more. Experimental results for a Linux implementation on a Cray XT5 machine are ...

- 4
Metrics
Total Citations4

Abstract

other

Applications for the Heterogeneous Computing Era

Pavan Balaji,
Jiayuan Meng

Pages 146–147https://doi.org/10.1177/1094342012442457

- 0
Metrics
Total Citations0

research-article

Large-scale fast Fourier transform on a heterogeneous multi-core system

Yan Li,
Jeffrey R Diamond,
Xu Wang,
Haibo Lin,
Yudong Yang,
Zhenxing Han

Pages 148–158https://doi.org/10.1177/1094342011435158

As interest in hybrid computing systems increases, people are eager to find new ways to exploit the unique and efficient computational power of the heterogeneous multi-core systems. Although there has been much interest in implementing high-performance ...

- 1
Metrics
Total Citations1

Abstract

research-article

Network-theoretic classification of parallel computation patterns

Sean Whalen,
Sophie Engle,
Sean Peisert,
Matt Bishop

Pages 159–169https://doi.org/10.1177/1094342012436618

Parallel computation in a high-performance computing environment can be characterized by the distributed memory access patterns of the underlying algorithm. During execution, networks of compute nodes exchange messages that indirectly exhibit these ...

- 4
Metrics
Total Citations4

Abstract

research-article

Characterization and transformation of unstructured control flow in bulk synchronous GPU applications

Haicheng Wu,
Gregory Diamos,
Jin Wang,
Si Li,
Sudhakar Yalamanchili

Pages 170–185https://doi.org/10.1177/1094342011434814

In this paper we identify important classes of program control flows in applications targeted to commercially available graphics processing units (GPUs) and characterize their presence in real workloads such as those that occur in CUDA and OpenCL. ...

- 6
Metrics
Total Citations6

Abstract

Save to Binder

Create a New Binder

Name

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation