Cited By
View all- Parasyris KGeorgakoudis GRangel ELaguna IDoerfert JMohror KArnold DBadia R(2023)Scalable Tuning of (OpenMP) GPU Applications via Kernel Record and ReplayProceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis10.1145/3581784.3607098(1-14)Online publication date: 12-Nov-2023
- Xie ZRaskar SEmani MVishwanath V(2023)TrainBF: High-Performance DNN Training Engine Using BFloat16 on AI AcceleratorsEuro-Par 2023: Parallel Processing10.1007/978-3-031-39698-4_31(458-473)Online publication date: 28-Aug-2023