Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2311.12833 (cs)

[Submitted on 3 Oct 2023]

Title:HPC-GPT: Integrating Large Language Model for High-Performance Computing

Authors:Xianzhong Ding, Le Chen, Murali Emani, Chunhua Liao, Pei-Hung Lin, Tristan Vanderbruggen, Zhen Xie, Alberto E. Cerpa, Wan Du

View PDF

Abstract:Large Language Models (LLMs), including the LLaMA model, have exhibited their efficacy across various general-domain natural language processing (NLP) tasks. However, their performance in high-performance computing (HPC) domain tasks has been less than optimal due to the specialized expertise required to interpret the model responses. In response to this challenge, we propose HPC-GPT, a novel LLaMA-based model that has been supervised fine-tuning using generated QA (Question-Answer) instances for the HPC domain. To evaluate its effectiveness, we concentrate on two HPC tasks: managing AI models and datasets for HPC, and data race detection. By employing HPC-GPT, we demonstrate comparable performance with existing methods on both tasks, exemplifying its excellence in HPC-related scenarios. Our experiments on open-source benchmarks yield extensive results, underscoring HPC-GPT's potential to bridge the performance gap between LLMs and HPC-specific tasks. With HPC-GPT, we aim to pave the way for LLMs to excel in HPC domains, simplifying the utilization of language models in complex computing applications.

Comments:	9 pages
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2311.12833 [cs.DC]
	(or arXiv:2311.12833v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2311.12833
Related DOI:	https://doi.org/10.1145/3624062.3624172

Submission history

From: Xianzhong Ding [view email]
[v1] Tue, 3 Oct 2023 01:34:55 UTC (109 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:HPC-GPT: Integrating Large Language Model for High-Performance Computing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:HPC-GPT: Integrating Large Language Model for High-Performance Computing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators