Author: Ge, Keshi : Search

Applied Filters

Publications

Conferences

Publication Date

5 Results for: Author: Ge, KeshiEdit SearchSave SearchRSS

Searched The ACM Guide to Computing Literature (3,793,868 records)|Limit your search to The ACM Full-Text Collection (767,113 records)

Showing 1 - 5of5 Results

Filters

Select All

Export Citations Save to Binder

per page:

Recency

research-article
May 2024
A Multidimensional Communication Scheduling Method for Hybrid Parallel DNN Training
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 35, Issue 8Pages 1415–1428https://doi.org/10.1109/TPDS.2024.3406420
The transformer-based deep neural network (DNN) models have shown considerable success across diverse tasks, prompting widespread adoption of distributed training methods such as data parallelism and pipeline parallelism. With the increasing parameter ...
0
Metrics
Total Citations0
research-article
July 2024
Advances of Pipeline Model Parallelism for Deep Learning Training: An Overview
Journal of Computer Science and Technology (JCST), Volume 39, Issue 3Pages 567–584https://doi.org/10.1007/s11390-024-3872-3
Abstract
Deep learning has become the cornerstone of artificial intelligence, playing an increasingly important role in human production and lifestyle. However, as the complexity of problem-solving increases, deep learning models become increasingly ...
0
Metrics
Total Citations0
Article
August 2023
Auto-Divide GNN: Accelerating GNN Training with Subgraph Division
Euro-Par 2023: Parallel ProcessingPages 367–382https://doi.org/10.1007/978-3-031-39698-4_25
Abstract
Graph Neural Networks (GNNs) have gained considerable attention in recent years for their exceptional performance on graph-structured data. Sampling-based GNN training is the most common method used for training GNNs on large-scale graphs, and it ...
0
Metrics
Total Citations0
research-article
May 2023
Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models
IEEE Transactions on Parallel and Distributed Systems (TPDS), Volume 34, Issue 5Pages 1466–1478https://doi.org/10.1109/TPDS.2023.3247001
Foundation models are in the process of becoming the dominant deep learning technology. Pretraining a foundation model is always time-consuming due to the large scale of both the model parameter and training dataset. Besides being computing-intensive, the ...
7
Metrics
Total Citations7
research-article
April 2023
Compressed Collective Sparse-Sketch for Distributed Data-Parallel Training of Deep Learning Models
IEEE Journal on Selected Areas in Communications (JSAC), Volume 41, Issue 4Pages 941–963https://doi.org/10.1109/JSAC.2023.3242733
Distributed data-parallel training (DDP) is prevalent in large-scale deep learning. To increase the training throughput and scalability, high-performance collective communication methods such as AllReduce have recently proliferated for DDP use. However, ...
0
Metrics
Total Citations0

Applied Filters

People

Names

Institutions

Authors

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Publisher

Conferences

Conference Event

Publication Date

A Multidimensional Communication Scheduling Method for Hybrid Parallel DNN Training

Advances of Pipeline Model Parallelism for Deep Learning Training: An Overview

Auto-Divide GNN: Accelerating GNN Training with Subgraph Division

Merak: An Efficient Distributed DNN Training Framework With Automated 3D Parallelism for Giant Foundation Models

Compressed Collective Sparse-Sketch for Distributed Data-Parallel Training of Deep Learning Models