Cited By
View all- Arai MFukumoto NMurai H(2024)Introducing software pipelining for the A64FX processor into LLVMProceedings of the International Conference on High Performance Computing in Asia-Pacific Region Workshops10.1145/3636480.3637093(1-6)Online publication date: 11-Jan-2024
- Lin ZMiao YXu GLi CSaarikivi OMaleki SYang F(2024)Efficient Schedule Construction for Distributed Execution of Large DNN ModelsIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2024.346691335:12(2375-2391)Online publication date: Dec-2024
- Lin ZMiao YXu GLi CSaarikivi OMaleki SYang F(2024)Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA57654.2024.00067(803-816)Online publication date: 2-Mar-2024
- Show More Cited By