Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleNovember 2013
The origin of mass
- Peter Boyle,
- Michael I. Buchoff,
- Norman Christ,
- Taku Izubuchi,
- Chulwoo Jung,
- Thomas C. Luu,
- Robert Mawhinney,
- Chris Schroeder,
- Ron Soltz,
- Pavlos Vranas,
- Joseph Wasem
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 4, Pages 1–10https://doi.org/10.1145/2503210.2504561The origin of mass is one of the deepest mysteries in science. Neutrons and protons, which account for almost all visible mass in the Universe, emerged from a primordial plasma through a cataclysmic phase transition microseconds after the Big Bang. ...
- research-articleNovember 2013
Distributed-memory parallel algorithms for generating massive scale-free networks using preferential attachment model
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 91, Pages 1–12https://doi.org/10.1145/2503210.2503291Recently, there has been substantial interest in the study of various random networks as mathematical models of complex systems. As these complex systems grow larger, the ability to generate progressively large random networks becomes all the more ...
- research-articleNovember 2013
MVAPICH-PRISM: a proxy-based communication framework using InfiniBand and SCIF for intel MIC clusters
- Sreeram Potluri,
- Devendar Bureddy,
- Khaled Hamidouche,
- Akshay Venkatesh,
- Krishna Kandalla,
- Hari Subramoni,
- Dhabaleswar K. (Dk) Panda
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 54, Pages 1–11https://doi.org/10.1145/2503210.2503288Xeon Phi, based on the Intel Many Integrated Core (MIC) architecture, packs up to 1TFLOPs of performance on a single chip while providing x86__64 compatibility. On the other hand, InfiniBand is one of the most popular choices of interconnect for ...
- research-articleNovember 2013
Swendsen-Wang multi-cluster algorithm for the 2D/3D Ising model on Xeon Phi and GPU
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 83, Pages 1–12https://doi.org/10.1145/2503210.2503254Simulations of the critical Ising model by means of local update algorithms suffer from critical slowing down. One way to partially compensate for the influence of this phenomenon on the runtime of simulations is using increasingly faster and parallel ...
- research-articleNovember 2013
On fast parallel detection of strongly connected components (SCC) in small-world graphs
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 92, Pages 1–11https://doi.org/10.1145/2503210.2503246Detecting strongly connected components (SCCs) in a directed graph is a fundamental graph analysis algorithm that is used in many science and engineering domains. Traditional approaches in parallel SCC detection, however, show limited performance and ...
- research-articleNovember 2013
Tera-scale 1D FFT with low-communication algorithm and Intel® Xeon Phi™ coprocessors
- Jongsoo Park,
- Ganesh Bikshandi,
- Karthikeyan Vaidyanathan,
- Ping Tak Peter Tang,
- Pradeep Dubey,
- Daehyun Kim
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 34, Pages 1–12https://doi.org/10.1145/2503210.2503242This paper demonstrates the first tera-scale performance of Intel® Xeon Phi™ coprocessors on 1D FFT computations. Applying a disciplined performance programming methodology of sound algorithm choice, valid performance model, and well-executed ...
- research-articleNovember 2013
Accelerating sparse matrix-vector multiplication on GPUs using bit-representation-optimized schemes
- Wai Teng Tang,
- Wen Jun Tan,
- Rajarshi Ray,
- Yi Wen Wong,
- Weiguang Chen,
- Shyh-hao Kuo,
- Rick Siow Mong Goh,
- Stephen John Turner,
- Weng-Fai Wong
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 26, Pages 1–12https://doi.org/10.1145/2503210.2503234The sparse matrix-vector (SpMV) multiplication routine is an important building block used in many iterative algorithms for solving scientific and engineering problems. One of the main challenges of SpMV is its memory-boundedness. Although compression ...
- research-articleNovember 2013
Deterministic scale-free pipeline parallelism with hyperqueues
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 32, Pages 1–12https://doi.org/10.1145/2503210.2503233Ubiquitous parallel computing aims to make parallel programming accessible to a wide variety of programming areas using deterministic and scale-free programming models built on a task abstraction. However, it remains hard to reconcile these attributes ...
- research-articleNovember 2013
Parallelizing the execution of sequential scripts
SC '13: Proceedings of the International Conference on High Performance Computing, Networking, Storage and AnalysisArticle No.: 31, Pages 1–12https://doi.org/10.1145/2503210.2503222Scripting is often used in science to create applications via the composition of existing programs. Parallel scripting systems allow the creation of such applications, but each system introduces the need to adopt a somewhat specialized programming ...