default search action
Wei Niu 0002
Person information
- affiliation: University of Georgia, Athens, GA, USA
- affiliation (PhD): College of William & Mary, Williamsburg, VA, USA
Other persons with the same name
- Wei Niu — disambiguation page
- Wei Niu 0001 — Beihang University, Ecole Centrale de Pékin, Beijing, China (and 1 more)
- Wei Niu 0003 — Texas A&M University, College Station, TX, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c37]Wei Niu, Gagan Agrawal, Bin Ren:
SoD2: Statically Optimizing Dynamic Deep Neural Network Execution. ASPLOS (1) 2024: 386-400 - [c36]Wei Niu, Md. Musfiqur Rahman Sanim, Zhihao Shu, Jiexiong Guan, Xipeng Shen, Miao Yin, Gagan Agrawal, Bin Ren:
SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile. ASPLOS (3) 2024: 916-931 - [c35]Wenqi Jia, Sian Jin, Jinzhen Wang, Wei Niu, Dingwen Tao, Miao Yin:
GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data. FlexScience@HPDC 2024: 34-41 - [c34]Gen Li, Lu Yin, Jie Ji, Wei Niu, Minghai Qin, Bin Ren, Linke Guo, Shiwei Liu, Xiaolong Ma:
NeurRev: Train Better Sparse Neural Network Practically via Neuron Revitalization. ICLR 2024 - [i37]Xuan Shen, Zhenglun Kong, Changdi Yang, Zhaoyang Han, Lei Lu, Peiyan Dong, Cheng Lyu, Chih-hsiang Li, Xuehang Guo, Zhihao Shu, Wei Niu, Miriam Leeser, Pu Zhao, Yanzhi Wang:
EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge. CoRR abs/2402.10787 (2024) - [i36]Wei Niu, Gagan Agrawal, Bin Ren:
SoD2: Statically Optimizing Dynamic Deep Neural Network. CoRR abs/2403.00176 (2024) - [i35]Jun Liu, Chao Wu, Changdi Yang, Hao Tang, Haoye Dong, Zhenglun Kong, Geng Yuan, Wei Niu, Dong Huang, Yanzhi Wang:
Efficient Pruning of Large Language Model with Adaptive Estimation Fusion. CoRR abs/2403.10799 (2024) - [i34]Wenqi Jia, Sian Jin, Jinzhen Wang, Wei Niu, Dingwen Tao, Miao Yin:
GWLZ: A Group-wise Learning-based Lossy Compression Framework for Scientific Data. CoRR abs/2404.13470 (2024) - [i33]Wei Niu, Md. Musfiqur Rahman Sanim, Zhihao Shu, Jiexiong Guan, Xipeng Shen, Miao Yin, Gagan Agrawal, Bin Ren:
SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile. CoRR abs/2404.13528 (2024) - [i32]Gen Li, Zhihao Shu, Jie Ji, Minghai Qin, Fatemeh Afghah, Wei Niu, Xiaolong Ma:
Data Overfitting for On-Device Super-Resolution with Dynamic Algorithm and Compiler Co-Design. CoRR abs/2407.02813 (2024) - [i31]Wenqi Jia, Youyuan Liu, Zhewen Hu, Jinzhen Wang, Boyuan Zhang, Wei Niu, Junzhou Huang, Stavros Kalafatis, Sian Jin, Miao Yin:
NeurLZ: On Enhancing Lossy Compression Performance based on Error-Controlled Neural Learning for Scientific Data. CoRR abs/2409.05785 (2024) - [i30]Zheng Zhan, Zhenglun Kong, Yifan Gong, Yushu Wu, Zichong Meng, Hangyu Zheng, Xuan Shen, Stratis Ioannidis, Wei Niu, Pu Zhao, Yanzhi Wang:
Exploring Token Pruning in Vision State Space Models. CoRR abs/2409.18962 (2024) - 2023
- [j6]Jou-An Chen, Wei Niu, Bin Ren, Yanzhi Wang, Xipeng Shen:
Survey: Exploiting Data Redundancy for Optimization of Deep Learning. ACM Comput. Surv. 55(10): 212:1-212:38 (2023) - [c33]Yanyu Li, Changdi Yang, Pu Zhao, Geng Yuan, Wei Niu, Jiexiong Guan, Hao Tang, Minghai Qin, Qing Jin, Bin Ren, Xue Lin, Yanzhi Wang:
Towards Real-Time Segmentation on the Edge. AAAI 2023: 1468-1476 - [c32]Jun Liu, Chao Wu, Geng Yuan, Wei Niu, Wenbin Zhang, Houbing Herbert Song:
A Scalable Real-time Semantic Segmentation Network for Autonomous Driving. AMC-SME 2023: 3-12 - [c31]Gen Li, Jie Ji, Minghai Qin, Wei Niu, Bin Ren, Fatemeh Afghah, Linke Guo, Xiaolong Ma:
Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting. CVPR 2023: 10259-10269 - [c30]Changdi Yang, Pu Zhao, Yanyu Li, Wei Niu, Jiexiong Guan, Hao Tang, Minghai Qin, Bin Ren, Xue Lin, Yanzhi Wang:
Pruning Parameterization with Bi-level Optimization for Efficient Semantic Segmentation on the Edge. CVPR 2023: 15402-15412 - [c29]Hsin-Hsuan Sung, Jou-An Chen, Wei Niu, Jiexiong Guan, Bin Ren, Xipeng Shen:
Decentralized Application-Level Adaptive Scheduling for Multi-Instance DNNs on Open Mobile Devices. USENIX ATC 2023: 865-877 - [i29]Gen Li, Jie Ji, Minghai Qin, Wei Niu, Bin Ren, Fatemeh Afghah, Linke Guo, Xiaolong Ma:
Towards High-Quality and Efficient Video Super-Resolution via Spatial-Temporal Data Overfitting. CoRR abs/2303.08331 (2023) - 2022
- [j5]Wei Niu, Zhengang Li, Xiaolong Ma, Peiyan Dong, Gang Zhou, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices Based on Fine-Grained Structured Weight Sparsity. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6224-6239 (2022) - [j4]Geng Yuan, Peiyan Dong, Mengshu Sun, Wei Niu, Zhengang Li, Yuxuan Cai, Yanyu Li, Jun Liu, Weiwen Jiang, Xue Lin, Bin Ren, Xulong Tang, Yanzhi Wang:
Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework. ACM Trans. Embed. Comput. Syst. 21(5): 65:1-65:22 (2022) - [j3]Yifan Gong, Geng Yuan, Zheng Zhan, Wei Niu, Zhengang Li, Pu Zhao, Yuxuan Cai, Sijia Liu, Bin Ren, Xue Lin, Xulong Tang, Yanzhi Wang:
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration. ACM Trans. Design Autom. Electr. Syst. 27(5): 47:1-47:26 (2022) - [c28]Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, Yanzhi Wang:
Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution. ECCV (19) 2022: 92-111 - [c27]Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Xuan Shen, Geng Yuan, Bin Ren, Hao Tang, Minghai Qin, Yanzhi Wang:
SPViT: Enabling Faster Vision Transformers via Latency-Aware Soft Token Pruning. ECCV (11) 2022: 620-640 - [c26]Yanyu Li, Xuan Shen, Geng Yuan, Jiexiong Guan, Wei Niu, Hao Tang, Bin Ren, Yanzhi Wang:
Real-Time Portrait Stylization on the Edge. IJCAI 2022: 5928-5931 - [c25]Xiaolong Ma, Geng Yuan, Zhengang Li, Yifan Gong, Tianyun Zhang, Wei Niu, Zheng Zhan, Pu Zhao, Ning Liu, Jian Tang, Xue Lin, Bin Ren, Yanzhi Wang:
BLCR: Towards Real-time DNN Execution with Block-based Reweighted Pruning. ISQED 2022: 1-8 - [c24]Wei Niu, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Gagan Agrawal, Bin Ren:
GCD2: A Globally Optimizing Compiler for Mapping DNNs to Mobile DSPs. MICRO 2022: 512-529 - [c23]Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, Jennifer G. Dy:
SparCL: Sparse Continual Learning on the Edge. NeurIPS 2022 - [c22]Hsin-Hsuan Sung, Yuanchao Xu, Jiexiong Guan, Wei Niu, Bin Ren, Yanzhi Wang, Shaoshan Liu, Xipeng Shen:
Brief Industry Paper: Enabling Level-4 Autonomous Driving on a Single $1k Off-the-Shelf Card. RTAS 2022: 297-300 - [i28]Yanyu Li, Xuan Shen, Geng Yuan, Jiexiong Guan, Wei Niu, Hao Tang, Bin Ren, Yanzhi Wang:
Real-Time Portrait Stylization on the Edge. CoRR abs/2206.01244 (2022) - [i27]Yushu Wu, Yifan Gong, Pu Zhao, Yanyu Li, Zheng Zhan, Wei Niu, Hao Tang, Minghai Qin, Bin Ren, Yanzhi Wang:
Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution. CoRR abs/2207.12577 (2022) - [i26]Jou-An Chen, Wei Niu, Bin Ren, Yanzhi Wang, Xipeng Shen:
Survey: Exploiting Data Redundancy for Optimization of Deep Learning. CoRR abs/2208.13363 (2022) - [i25]Zifeng Wang, Zheng Zhan, Yifan Gong, Geng Yuan, Wei Niu, Tong Jian, Bin Ren, Stratis Ioannidis, Yanzhi Wang, Jennifer G. Dy:
SparCL: Sparse Continual Learning on the Edge. CoRR abs/2209.09476 (2022) - 2021
- [j2]Hui Guan, Shaoshan Liu, Xiaolong Ma, Wei Niu, Bin Ren, Xipeng Shen, Yanzhi Wang, Pu Zhao:
CoCoPIE: enabling real-time AI on off-the-shelf mobile devices via compression-compilation co-design. Commun. ACM 64(6): 62-68 (2021) - [c21]Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang:
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design. AAAI 2021: 955-963 - [c20]Wei Niu, Mengshu Sun, Zhengang Li, Jou-An Chen, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Sijia Liu, Xue Lin, Bin Ren:
RT3D: Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices. AAAI 2021: 9179-9187 - [c19]Yuxuan Cai, Geng Yuan, Hongjia Li, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang:
A Compression-Compilation Co-Design Framework Towards Real-Time Object Detection on Mobile Devices. AAAI 2021: 15997-16000 - [c18]Hongjia Li, Geng Yuan, Wei Niu, Yuxuan Cai, Mengshu Sun, Zhengang Li, Bin Ren, Xue Lin, Yanzhi Wang:
Real-Time Mobile Acceleration of DNNs: From Computer Vision to Medical Applications. ASP-DAC 2021: 581-586 - [c17]Zhengang Li, Geng Yuan, Wei Niu, Pu Zhao, Yanyu Li, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin:
NPAS: A Compiler-Aware Framework of Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration. CVPR 2021: 14255-14266 - [c16]Pu Zhao, Geng Yuan, Yuxuan Cai, Wei Niu, Qi Liu, Wujie Wen, Bin Ren, Yanzhi Wang, Xue Lin:
Neural Pruning Search for Real-Time Object Detection of Autonomous Vehicles. DAC 2021: 835-840 - [c15]Qihan Wang, Wei Niu, Li Chen, Ruoming Jin, Bin Ren:
HEALS: A Parallel eALS Recommendation System on CPU/GPU Heterogeneous Platforms. HiPC 2021: 252-261 - [c14]Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David R. Kaeli, Bin Ren, Xue Lin, Yanzhi Wang:
Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search. ICCV 2021: 4801-4811 - [c13]Chengming Zhang, Geng Yuan, Wei Niu, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, Dingwen Tao:
ClickTrain: efficient and accurate end-to-end deep learning training via fine-grained architecture-preserving pruning. ICS 2021: 266-278 - [c12]Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang:
A Compression-Compilation Framework for On-mobile Real-time BERT Applications. IJCAI 2021: 5000-5003 - [c11]Xuan Shen, Geng Yuan, Wei Niu, Xiaolong Ma, Jiexiong Guan, Zhengang Li, Bin Ren, Yanzhi Wang:
Towards Fast and Accurate Multi-Person Pose Estimation on Mobile Devices. IJCAI 2021: 5012-5015 - [c10]Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, Siyue Wang, Minghai Qin, Bin Ren, Yanzhi Wang, Sijia Liu, Xue Lin:
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge. NeurIPS 2021: 20838-20850 - [c9]Wei Niu, Jiexiong Guan, Yanzhi Wang, Gagan Agrawal, Bin Ren:
DNNFusion: accelerating deep neural networks execution with advanced operator fusion. PLDI 2021: 883-898 - [c8]Pu Zhao, Wei Niu, Geng Yuan, Yuxuan Cai, Hsin-Hsuan Sung, Shaoshan Liu, Sijia Liu, Xipeng Shen, Bin Ren, Yanzhi Wang, Xue Lin:
Brief Industry Paper: Towards Real-Time 3D Object Detection for Autonomous Vehicles with Pruning Search. RTAS 2021: 425-428 - [c7]Geng Yuan, Peiyan Dong, Mengshu Sun, Wei Niu, Zhengang Li, Yuxuan Cai, Jun Liu, Weiwen Jiang, Xue Lin, Bin Ren, Xulong Tang, Yanzhi Wang:
Work in Progress: Mobile or FPGA? A Comprehensive Evaluation on Energy Efficiency and a Unified Optimization Framework. RTAS 2021: 493-496 - [i24]Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang:
A Compression-Compilation Framework for On-mobile Real-time BERT Applications. CoRR abs/2106.00526 (2021) - [i23]Pu Zhao, Wei Niu, Geng Yuan, Yuxuan Cai, Bin Ren, Yanzhi Wang, Xue Lin:
Achieving Real-Time Object Detection on MobileDevices with Neural Pruning Search. CoRR abs/2106.14943 (2021) - [i22]Xuan Shen, Geng Yuan, Wei Niu, Xiaolong Ma, Jiexiong Guan, Zhengang Li, Bin Ren, Yanzhi Wang:
Towards Fast and Accurate Multi-Person Pose Estimation on Mobile Devices. CoRR abs/2106.15304 (2021) - [i21]Zheng Zhan, Yifan Gong, Pu Zhao, Geng Yuan, Wei Niu, Yushu Wu, Tianyun Zhang, Malith Jayaweera, David R. Kaeli, Bin Ren, Xue Lin, Yanzhi Wang:
Achieving on-Mobile Real-Time Super-Resolution with Neural Architecture and Pruning Search. CoRR abs/2108.08910 (2021) - [i20]Wei Niu, Zhengang Li, Xiaolong Ma, Peiyan Dong, Gang Zhou, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
GRIM: A General, Real-Time Deep Learning Inference Framework for Mobile Devices based on Fine-Grained Structured Weight Sparsity. CoRR abs/2108.11033 (2021) - [i19]Wei Niu, Jiexiong Guan, Yanzhi Wang, Gagan Agrawal, Bin Ren:
DNNFusion: Accelerating Deep Neural Networks Execution with Advanced Operator Fusion. CoRR abs/2108.13342 (2021) - [i18]Hsin-Hsuan Sung, Yuanchao Xu, Jiexiong Guan, Wei Niu, Shaoshan Liu, Bin Ren, Yanzhi Wang, Xipeng Shen:
Enabling Level-4 Autonomous Driving on a Single 1 Off-the-Shelf Card. CoRR abs/2110.06373 (2021) - [i17]Geng Yuan, Xiaolong Ma, Wei Niu, Zhengang Li, Zhenglun Kong, Ning Liu, Yifan Gong, Zheng Zhan, Chaoyang He, Qing Jin, Siyue Wang, Minghai Qin, Bin Ren, Yanzhi Wang, Sijia Liu, Xue Lin:
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge. CoRR abs/2110.14032 (2021) - [i16]Yifan Gong, Geng Yuan, Zheng Zhan, Wei Niu, Zhengang Li, Pu Zhao, Yuxuan Cai, Sijia Liu, Bin Ren, Xue Lin, Xulong Tang, Yanzhi Wang:
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration. CoRR abs/2111.11581 (2021) - [i15]Zhenglun Kong, Peiyan Dong, Xiaolong Ma, Xin Meng, Wei Niu, Mengshu Sun, Bin Ren, Minghai Qin, Hao Tang, Yanzhi Wang:
SPViT: Enabling Faster Vision Transformers via Soft Token Pruning. CoRR abs/2112.13890 (2021) - 2020
- [c6]Xiaolong Ma, Fu-Ming Guo, Wei Niu, Xue Lin, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang:
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-Time Execution on Mobile Devices. AAAI 2020: 5117-5124 - [c5]Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning. ASPLOS 2020: 907-922 - [c4]Peiyan Dong, Siyue Wang, Wei Niu, Chengming Zhang, Sheng Lin, Zhengang Li, Yifan Gong, Bin Ren, Xue Lin, Dingwen Tao:
RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition. DAC 2020: 1-6 - [c3]Xiaolong Ma, Wei Niu, Tianyun Zhang, Sijia Liu, Sheng Lin, Hongjia Li, Wujie Wen, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang:
An Image Enhancing Pattern-Based Sparsity for Real-Time Inference on Mobile Devices. ECCV (13) 2020: 629-645 - [c2]Yifan Gong, Zheng Zhan, Zhengang Li, Wei Niu, Xiaolong Ma, Wenhao Wang, Bin Ren, Caiwen Ding, Xue Lin, Xiaolin Xu, Yanzhi Wang:
A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration Framework. ACM Great Lakes Symposium on VLSI 2020: 119-124 - [c1]Wei Niu, Pu Zhao, Zheng Zhan, Xue Lin, Yanzhi Wang, Bin Ren:
Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization. IJCAI 2020: 5306-5308 - [i14]Wei Niu, Xiaolong Ma, Sheng Lin, Shihao Wang, Xuehai Qian, Xue Lin, Yanzhi Wang, Bin Ren:
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning. CoRR abs/2001.00138 (2020) - [i13]Xiaolong Ma, Wei Niu, Tianyun Zhang, Sijia Liu, Fu-Ming Guo, Sheng Lin, Hongjia Li, Xiang Chen, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang:
An Image Enhancing Pattern-based Sparsity for Real-time Inference on Mobile Devices. CoRR abs/2001.07710 (2020) - [i12]Xiaolong Ma, Zhengang Li, Yifan Gong, Tianyun Zhang, Wei Niu, Zheng Zhan, Pu Zhao, Jian Tang, Xue Lin, Bin Ren, Yanzhi Wang:
BLK-REW: A Unified Block-based DNN Pruning Framework using Reweighted Regularization Method. CoRR abs/2001.08357 (2020) - [i11]Peiyan Dong, Siyue Wang, Wei Niu, Chengming Zhang, Sheng Lin, Zhengang Li, Yifan Gong, Bin Ren, Xue Lin, Yanzhi Wang, Dingwen Tao:
RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition. CoRR abs/2002.11474 (2020) - [i10]Zheng Zhan, Yifan Gong, Zhengang Li, Pu Zhao, Xiaolong Ma, Wei Niu, Xiaolin Xu, Bin Ren, Yanzhi Wang, Xue Lin:
A Privacy-Preserving DNN Pruning and Mobile Acceleration Framework. CoRR abs/2003.06513 (2020) - [i9]Wei Niu, Pu Zhao, Zheng Zhan, Xue Lin, Yanzhi Wang, Bin Ren:
Towards Real-Time DNN Inference on Mobile Platforms with Model Pruning and Compiler Optimization. CoRR abs/2004.11250 (2020) - [i8]Wei Niu, Mengshu Sun, Zhengang Li, Jou-An Chen, Jiexiong Guan, Xipeng Shen, Yanzhi Wang, Xue Lin, Bin Ren:
Achieving Real-Time Execution of 3D Convolutional Neural Networks on Mobile Devices. CoRR abs/2007.09835 (2020) - [i7]Yuxuan Cai, Hongjia Li, Geng Yuan, Wei Niu, Yanyu Li, Xulong Tang, Bin Ren, Yanzhi Wang:
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design. CoRR abs/2009.05697 (2020) - [i6]Wei Niu, Zhenglun Kong, Geng Yuan, Weiwen Jiang, Jiexiong Guan, Caiwen Ding, Pu Zhao, Sijia Liu, Bin Ren, Yanzhi Wang:
Achieving Real-Time Execution of Transformer-based Large-scale Models on Mobile with Compiler-aware Neural Architecture Optimization. CoRR abs/2009.06823 (2020) - [i5]Chengming Zhang, Geng Yuan, Wei Niu, Jiannan Tian, Sian Jin, Donglin Zhuang, Zhe Jiang, Yanzhi Wang, Bin Ren, Shuaiwen Leon Song, Dingwen Tao:
An Efficient End-to-End Deep Learning Training Framework via Fine-Grained Pattern-Based Pruning. CoRR abs/2011.10170 (2020) - [i4]Zhengang Li, Geng Yuan, Wei Niu, Yanyu Li, Pu Zhao, Yuxuan Cai, Xuan Shen, Zheng Zhan, Zhenglun Kong, Qing Jin, Zhiyu Chen, Sijia Liu, Kaiyuan Yang, Bin Ren, Yanzhi Wang, Xue Lin:
6.7ms on Mobile with over 78% ImageNet Accuracy: Unified Network Pruning and Architecture Search for Beyond Real-Time Mobile Acceleration. CoRR abs/2012.00596 (2020) - [i3]Pu Zhao, Wei Niu, Geng Yuan, Yuxuan Cai, Hsin-Hsuan Sung, Wujie Wen, Sijia Liu, Xipeng Shen, Bin Ren, Yanzhi Wang, Xue Lin:
Achieving Real-Time LiDAR 3D Object Detection on a Mobile Device. CoRR abs/2012.13801 (2020)
2010 – 2019
- 2019
- [i2]Wei Niu, Xiaolong Ma, Yanzhi Wang, Bin Ren:
26ms Inference Time for ResNet-50: Towards Real-Time Execution of all DNNs on Smartphone. CoRR abs/1905.00571 (2019) - [i1]Xiaolong Ma, Fu-Ming Guo, Wei Niu, Xue Lin, Jian Tang, Kaisheng Ma, Bin Ren, Yanzhi Wang:
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices. CoRR abs/1909.05073 (2019) - 2017
- [j1]Jianwei Niu, Shihao Wang, Wei Niu, Mohammed Atiquzzaman:
User-aware partitioning algorithm for mobile cloud computing based on maximum graph cuts. Comput. Networks 129: 193-206 (2017)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:34 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint