default search action
SC 2018: Dallas, TX, USA
- Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC 2018, Dallas, TX, USA, November 11-16, 2018. IEEE / ACM 2018
Data and storage
- Yinghao Yu, Renfei Huang, Wei Wang, Jun Zhang, Khaled Ben Letaief:
SP-cache: load-balanced, redundancy-free cluster caching with selective partition. 1:1-1:13 - Ali Anwar, Yue Cheng, Hai Huang, Jingoo Han, Hyogi Sim, Dongyoon Lee, Fred Douglis, Ali Raza Butt:
bespoKV: application tailored scale-out key-value stores. 2:1-2:16 - Qing Zheng, Charles D. Cranor, Danhao Guo, Gregory R. Ganger, George Amvrosiadis, Garth A. Gibson, Bradley W. Settlemyer, Gary Grider, Fan Guo:
Scaling embedded in-situ indexing with deltaFS. 3:1-3:15
Next-generation networking
- Matthias A. Blumrich, Nan Jiang, Larry R. Dennison:
Exploiting idle resources in a high-radix switch for supplemental storage. 4:1-4:13 - Qiao Xiang, J. Jensen Zhang, Xin Tony Wang, Y. Jace Liu, Chin Guok, Franck Le, John MacAuley, Harvey B. Newman, Yang Richard Yang:
Fine-grained, multi-domain network resource abstraction as a fundamental primitive to enable high-performance, collaborative data sciences. 5:1-5:13 - Hans Eberle, Larry Dennison:
Light-weight protocols for wire-speed ordering. 6:1-6:12
Resilience
- Christopher Zimmer, Don Maxwell, Stephen Taylor McNally, Scott Atchley, Sudharshan S. Vazhkudai:
GPU age-aware scheduling to improve the reliability of leadership jobs on Titan. 7:1-7:11 - Luanzheng Guo, Dong Li, Ignacio Laguna, Martin Schulz:
FlipTracker: understanding natural error resilience in HPC applications. 8:1-8:14 - Anwesha Das, Frank Mueller, Paul Hargrove, Eric Roman, Scott B. Baden:
Doomsday: predicting which node will fail when on supercomputers. 9:1-9:14
Biology applications
- Evangelos Georganas, Rob Egan, Steven A. Hofmeyr, Eugene Goltsman, Bill Arndt, Andrew Tritt, Aydin Buluç, Leonid Oliker, Katherine A. Yelick:
Extreme scale de novo metagenome assembly. 10:1-10:13 - Tony C. Pan, Sanchit Misra, Srinivas Aluru:
Optimizing high performance distributed memory parallel hash tables for DNA k-mer counting. 11:1-11:13 - Xiaohui Duan, Ping Gao, Tingjian Zhang, Meng Zhang, Weiguo Liu, Wusheng Zhang, Wei Xue, Haohuan Fu, Lin Gan, Dexun Chen, Xiangxu Meng, Guangwen Yang:
Redesigning LAMMPS for peta-scale and hundred-billion-atom simulation on Sunway TaihuLight. 12:1-12:12
Large-scale algorithms
- Liandeng Li, Teng Yu, Wenlai Zhao, Haohuan Fu, Chenyu Wang, Li Tan, Guangwen Yang, John Thomson:
Large-scale hierarchical k-means for heterogeneous many-core supercomputers. 13:1-13:11 - Yang Hu, Hang Liu, H. Howie Huang:
TriCore: parallel triangle counting on GPUs. 14:1-14:12 - Chenhan D. Yu, Severin Reiz, George Biros:
Distributed-memory hierarchical compression of dense SPD matrices. 15:1-15:15
Performance and energy analysis
- Nader Boushehrinejadmoradi, Adarsh Yoga, Santosh Nagarakatte:
A parallelism profiler with what-if analyses for OpenMP programs. 16:1-16:14 - Mark Endrei, Chao Jin, Minh Ngoc Dinh, David Abramson, Heidi Poxon, Luiz DeRose, Bronis R. de Supinski:
Energy efficiency modeling of parallel applications. 17:1-17:13 - John D. McCalpin:
HPL and DGEMM performance variability on the Xeon Platinum 8160 processor. 18:1-18:13
Algorithms on sparse data
- Jiajia Li, Jimeng Sun, Richard W. Vuduc:
HiCOO: hierarchical storage of sparse tensors. 19:1-19:15 - Aryan Eftekhari, Matthias Bollhöfer, Olaf Schenk:
Distributed memory sparse inverse covariance matrix estimation on high-performance computing architectures. 20:1-20:12 - Tahsin Reza, Matei Ripeanu, Nicolas Tripoul, Geoffrey Sanders, Roger Pearce:
PruneJuice: pruning trillion-edge graphs to a precise pattern-matching solution. 21:1-21:17
Performance optimization studies
- Stijn Eyerman, Wim Heirman, Kristof Du Bois, Joshua B. Fryman, Ibrahim Hur:
Many-core graph workload analysis. 22:1-22:11 - Shintaro Iwasaki, Abdelhalim Amer, Kenjiro Taura, Pavan Balaji:
Lessons learned from analyzing dynamic promotion for user-level threading. 23:1-23:12 - Preeti Malakar, Todd S. Munson, Christopher Knight, Venkatram Vishwanath, Michael E. Papka:
Topology-aware space-shared co-analysis of large-scale molecular dynamics simulations. 24:1-24:15
Resource management and interference
- Maxime Martinasso, Miguel Gila, Mauro Bianco, Sadaf R. Alam, Colin McMurtrie, Thomas C. Schulthess:
RM-replay: a high-fidelity tuning, optimization and exploration tool for resource management. 25:1-25:13 - Samuel D. Pollard, Nikhil Jain, Stephen Herbein, Abhinav Bhatele:
Evaluation of an interference-free node allocation policy on fat-tree clusters. 26:1-26:13 - Staci A. Smith, Clara E. Cromey, David K. Lowenthal, Jens Domke, Nikhil Jain, Jayaraman J. Thiagarajan, Abhinav Bhatele:
Mitigating inter-job interference using adaptive flow-aware routing. 27:1-27:15
MPI optimization and characterization
- Sourav Chakraborty, Mohammadreza Bayatpour, Jahanzeb Maqbool Hashmi, Hari Subramoni, Dhabaleswar K. Panda:
Cooperative rendezvous protocols for improved performance and overlap. 28:1-28:13 - Surabhi Jain, Rashid Kaleem, Marc Gamell Balmana, Akhil Langer, Dmitry Durnov, Alexander Sannikov, Maria Garzaran:
Framework for scalable intra-node collective operations using shared memory. 29:1-29:12 - Sudheer Chunduri, Scott Parker, Pavan Balaji, Kevin Harms, Kalyan Kumaran:
Characterization of MPI usage on a production supercomputer. 30:1-30:15
Non-volatile memory
- Kai Wu, Jie Ren, Dong Li:
Runtime data management on non-volatile memory-based heterogeneous memory for task-parallel programs. 31:1-31:13 - Pak Markthub, Mehmet E. Belviranli, Seyong Lee, Jeffrey S. Vetter, Satoshi Matsuoka:
DRAGON: breaking GPU memory capacity limits with direct NVM access. 32:1-32:13 - Ivy Bo Peng, Jeffrey S. Vetter:
Siena: exploring the design space of heterogeneous memory systems. 33:1-33:14
Task-based programming
- Wonchan Lee, Elliott Slaughter, Michael Bauer, Sean Treichler, Todd Warszawski, Michael Garland, Alex Aiken:
Dynamic tracing: memoization of task graphs for dynamic task-based runtimes. 34:1-34:13 - Paul Caheny, Lluc Alvarez, Mateo Valero, Miquel Moretó, Marc Casas:
Runtime-assisted cache coherence deactivation in task parallel programs. 35:1-35:12 - Gökalp Demirci, Ivana Marincic, Henry Hoffmann:
A divide and conquer algorithm for DAG scheduling under power constraints. 36:1-36:12
Clouds and distributed computing
- Georgios Andreadis, Laurens Versluis, Fabian Mastenbroek, Alexandru Iosup:
A reference architecture for datacenter scheduling: design, validation, and experiments. 37:1-37:15 - Feng Liu, Kate Keahey, Pierre Riteau, Jon B. Weissman:
Dynamically negotiating capacity between on-demand and batch clusters. 38:1-38:11 - Nathaniel Kremer-Herman, Benjamín Tovar, Douglas Thain:
A lightweight model for right-sizing master-worker applications. 39:1-39:13
Physics and tensor applications
- Bingwei Chen, Haohuan Fu, Yanwen Wei, Conghui He, Wenqiang Zhang, Yuxuan Li, Wubin Wan, Wei Zhang, Lin Gan, Wei Zhang, Zhenguo Zhang, Guangwen Yang, Xiaofei Chen:
Simulating the Wenchuan earthquake with accurate surface topography on Sunway TaihuLight. 40:1-40:12 - Hua Huang, Edmond Chow:
Accelerating quantum chemistry with vectorized and batched integrals. 41:1-41:14 - Jee W. Choi, Xing Liu, Venkatesan T. Chakaravarthy:
High-performance dense tucker decomposition on GPU clusters. 42:1-42:11
Resilience II
- Scott Levy, Kurt B. Ferreira, Nathan DeBardeleben, Taniya Siddiqua, Vilas Sridharan, Elisabeth Baseman:
Lessons learned from memory errors observed over the lifetime of Cielo. 43:1-43:12 - Zaeem Hussain, Taieb Znati, Rami G. Melhem:
Partial redundancy in HPC systems with non-uniform node reliabilities. 44:1-44:11 - Chun-Kai Chang, Sangkug Lym, Nicholas Kelly, Michael B. Sullivan, Mattan Erez:
Evaluating and accelerating high-fidelity error injection for HPC. 45:1-45:13
Arithmetic and optimization
- Prashant Singh Rawat, Aravind Sukumaran-Rajam, Atanas Rountev, Fabrice Rastello, Louis-Noël Pouchet, P. Sadayappan:
Associative instruction reordering to alleviate register pressure. 46:1-46:13 - Azzam Haidar, Stanimire Tomov, Jack J. Dongarra, Nicholas J. Higham:
Harnessing GPU tensor cores for fast FP16 arithmetic to speed up mixed-precision iterative refinement solvers. 47:1-47:11 - Harshitha Menon, Michael O. Lam, Daniel Osei-Kuffuor, Markus Schordan, Scott Lloyd, Kathryn M. Mohror, Jeffrey Hittinger:
ADAPT: algorithmic differentiation applied to floating-point precision tuning. 48:1-48:13
Gordon bell prize finalist #1
- Tsuyoshi Ichimura, Kohei Fujita, Takuma Yamaguchi, Akira Naruse, Jack C. Wells, Thomas C. Schulthess, Tjerk P. Straatsma, Christopher Zimmer, Maxime Martinasso, Kengo Nakajima, Muneo Hori, Lalith Maddegedara:
A fast scalable implicit solver for nonlinear time-evolution earthquake city problem on low-ordered unstructured finite elements with artificial intelligence and transprecision computing. 49:1-49:11 - Robert M. Patton, J. Travis Johnston, Steven R. Young, Catherine D. Schuman, Don D. March, Thomas E. Potok, Derek C. Rose, Seung-Hwan Lim, Thomas P. Karnowski, Maxim A. Ziatdinov, Sergei V. Kalinin:
167-PFlops deep learning for electron microscopy: from learning physics to atomic manipulation. 50:1-50:11 - Thorsten Kurth, Sean Treichler, Joshua Romero, Mayur Mudigonda, Nathan Luehr, Everett H. Phillips, Ankur Mahesh, Michael A. Matheson, Jack Deslippe, Massimiliano Fatica, Prabhat, Michael Houston:
Exascale deep learning for climate analytics. 51:1-51:12
Large scale system deployments
- Sudharshan S. Vazhkudai, Bronis R. de Supinski, Arthur S. Bland, Al Geist, James C. Sexton, Jim Kahle, Christopher Zimmer, Scott Atchley, Sarp Oral, Don E. Maxwell, Verónica G. Vergara Larrea, Adam Bertsch, Robin Goldstone, Wayne Joubert, Chris Chambreau, David Appelhans, Robert Blackmore, Ben Casses, George Chochia, Gene Davison, Matthew A. Ezell, Tom Gooding, Elsa Gonsiorowski, Leopold Grinberg, Bill Hanson, Bill Hartner, Ian Karlin, Matthew L. Leininger, Dustin Leverman, Chris Marroquin, Adam Moody, Martin Ohmacht, Ramesh Pankajakshan, Fernando Pizzano, James H. Rogers, Bryan S. Rosenburg, Drew Schmidt, Mallikarjun Shankar, Feiyi Wang, Py Watson, Bob Walkup, Lance D. Weems, Junqi Yin:
The design, deployment, and evaluation of the CORAL pre-exascale systems. 52:1-52:12 - Gregory H. Bauer, Brett M. Bode, Jeremy Enos, William T. Kramer, Scott A. Lathrop, Celso L. Mendes, Robert Sisneros:
Best practices and lessons from deploying and operating a sustained-petascale system: the blue waters experience. 53:1-53:12 - Kazuhiko Komatsu, Shintaro Momose, Yoko Isobe, Osamu Watanabe, Akihiro Musa, Mitsuo Yokokawa, Toshikazu Aoyama, Masayuki Sato, Hiroaki Kobayashi:
Performance evaluation of a vector supercomputer SX-aurora TSUBASA. 54:1-54:12
Gordon bell prize finalist #2
- Evan Berkowitz, Michael A. Clark, Arjun Singh Gambhir, Kenneth McElvain, Amy N. Nicholson, Enrico Rinaldi, Pavlos Vranas, André Walker-Loud, Chia-Cheng Chang, Bálint Joó, Thorsten Kurth, Kostas Orginos:
Simulating the weak death of the Neutron in a femtoscale universe with near-exascale computing. 55:1-55:9 - Heng Lin, Xiaowei Zhu, Bowen Yu, Xiongchao Tang, Wei Xue, Wenguang Chen, Lufei Zhang, Torsten Hoefler, Xiaosong Ma, Xin Liu, Weimin Zheng, Jingfang Xu:
ShenTu: processing multi-trillion edge graphs on millions of cores in seconds. 56:1-56:11 - Wayne Joubert, Deborah A. Weighill, David Kainer, Sharlee Climer, Amy Justice, Kjiersten Fagnan, Daniel A. Jacobson:
Attacking the opioid epidemic: determining the epistatic and pleiotropic genetic architectures for chronic pain and opioid addiction. 57:1-57:14
Graph algorithms and systems
- Yuede Ji, Hang Liu, H. Howie Huang:
iSpan: parallel identification of strongly connected components with spanning trees. 58:1-58:12 - Arif Khan, Krzysztof Choromanski, Alex Pothen, S. M. Ferdous, Mahantesh Halappanavar, Antonino Tumeo:
Adaptive anonymization of data using b-edge cover. 59:1-59:11 - Martin Winter, Daniel Mlakar, Rhaleb Zayer, Hans-Peter Seidel, Markus Steinberger:
faimGraph: high performance management of fully-dynamic graphs under tight memory constraints on the GPU. 60:1-60:13
Programming systems tools
- Yizi Gu, John M. Mellor-Crummey:
Dynamic data race detection for OpenMP programs. 61:1-61:12 - Kazem Cheshmi, Shoaib Kamil, Michelle Mills Strout, Maryam Mehri Dehnavi:
ParSy: inspection and transformation of sparse matrix computations for parallelism. 62:1-62:15 - Fangke Ye, Jisheng Zhao, Vivek Sarkar:
Detecting MPI usage anomalies via partial program symbolic execution. 63:1-63:5
Deep learning
- Randall Pittman, Hui Guan, Xipeng Shen, Seung-Hwan Lim, Robert M. Patton:
Exploring flexible communications for streamlining DNN ensemble training pipelines. 64:1-64:12 - Amrita Mathuriya, Deborah Bard, Peter Mendygral, Lawrence Meadows, James Arnemann, Lei Shao, Siyu He, Tuomas Kärnä, Diana Moise, Simon J. Pennycook, Kristyn J. Maschhoff, Jason Sewall, Nalini Kumar, Shirley Ho, Michael F. Ringenburg, Prabhat, Victor W. Lee:
CosmoFlow: using deep learning to learn the universe at scale. 65:1-65:11 - Evangelos Georganas, Sasikanth Avancha, Kunal Banerjee, Dhiraj D. Kalamkar, Greg Henry, Hans Pabst, Alexander Heinecke:
Anatomy of high-performance deep learning convolutions on SIMD architectures. 66:1-66:12
Resilience III: GPUs
- Abdulrahman Mahmoud, Siva Kumar Sastry Hari, Michael B. Sullivan, Timothy Tsai, Stephen W. Keckler:
Optimizing software-directed instruction replication for GPU error detection. 67:1-67:12 - Jieyang Chen, Hongbo Li, Sihuan Li, Xin Liang, Panruo Wu, Dingwen Tao, Kaiming Ouyang, Yuanlai Liu, Kai Zhao, Qiang Guan, Zizhong Chen:
Fault tolerant one-sided matrix decompositions on heterogeneous systems with GPUs. 68:1-68:12 - Cham Kalra, Fritz Previlon, Xiangyu Li, Norman Rubin, David R. Kaeli:
PRISM: predicting resilience of GPU applications using statistical methods. 69:1-69:14
Astrophysics applications
- Muhammed Nufail Farooqi, Tan Nguyen, Weiqun Zhang, Ann S. Almgren, John Shalf, Didem Unat:
Phase asynchronous AMR execution for productive and performant astrophysical flows. 70:1-70:14 - Jia Shi, Ruipeng Li, Yuanzhe Xi, Yousef Saad, Maarten V. de Hoop:
Computing planetary interior normal modes with a highly parallel polynomial filtering eigensolver. 71:1-71:13
File systems: data movement and provenance
- Devarshi Ghoshal, Lavanya Ramakrishnan, Deborah A. Agarwal:
Dac-Man: data change management for scientific datasets on HPC systems. 72:1-72:13 - Pradeep Subedi, Philip E. Davis, Shaohua Duan, Scott Klasky, Hemanth Kolla, Manish Parashar:
Stacker: an autonomic data movement engine for extreme-scale data staging-based in-situ workflows. 73:1-73:11 - Glenn K. Lockwood, Shane Snyder, Teng Wang, Suren Byna, Philip H. Carns, Nicholas J. Wright:
A year in the life of a parallel file system. 74:1-74:13
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.