Cited By
View all- Bastoul CZhang ZRazanajato HLossing NSusungi Ade Juan JFilhol EJarry BConsolaro GZhang RLee J(2022)Optimizing GPU deep learning operators with polyhedral scheduling constraint injectionProceedings of the 20th IEEE/ACM International Symposium on Code Generation and Optimization10.1109/CGO53902.2022.9741260(313-324)Online publication date: 2-Apr-2022
- Shirako JHayashi APaul STumanov ASarkar V(2022)Automatic Parallelization of Python Programs for Distributed Heterogeneous ComputingEuro-Par 2022: Parallel Processing10.1007/978-3-031-12597-3_22(350-366)Online publication date: 22-Aug-2022
- Tripathy DAbdolrashidi AFan QWong DSatpathy M(2021)LocalityGuru: A PTX Analyzer for Extracting Thread Block-level Locality in GPGPUs2021 IEEE International Conference on Networking, Architecture and Storage (NAS)10.1109/NAS51552.2021.9605411(1-8)Online publication date: Oct-2021
- Show More Cited By