Cited By
View all- Rebai AOjewale MUllah ACanini MFahmy S(2024)SqueezeNIC: Low-Latency In-NIC Compression for Distributed Deep LearningProceedings of the 2024 SIGCOMM Workshop on Networks for AI Computing10.1145/3672198.3673801(61-68)Online publication date: 4-Aug-2024
- Strati FMa XKlimovic A(2024)Orion: Interference-aware, Fine-grained GPU Sharing for ML ApplicationsProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3629578(1075-1092)Online publication date: 22-Apr-2024
- Wang HWang LXu HWang YLi YHan YTsafrir DMusuvathi MGupta RAbu-Ghazaleh N(2024)PrimePar: Efficient Spatial-temporal Tensor Partitioning for Large Transformer Model TrainingProceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 310.1145/3620666.3651357(801-817)Online publication date: 27-Apr-2024
- Show More Cited By