default search action
Fan Yang 0024
Person information
- affiliation: Microsoft Research Asia, Beijing, China
- affiliation (former): Nanjing Universiiy, Department of Computer Science, State Key Lab for Novel Software Technology, China
Other persons with the same name
- Fan Yang — disambiguation page
- Fan Yang 0001 — Fudan University, State Key Lab of ASIC & System, School of Microelectronics, Shanghai, China
- Fan Yang 0002 — Cornell University, Ithaca, NY, USA
- Fan Yang 0003 — FernUniversität Hagen, Faculty of Mathematics and Computer Science, Germany (and 1 more)
- Fan Yang 0004 — Utrecht University, The Netherlands (and 2 more)
- Fan Yang 0005 — Tsinghua University, Department of Automation, Beijing, China (and 1 more)
- Fan Yang 0006 — Hong Kong University of Science and Technology, Department of Electronic and Computer Engineering, Clear Water Bay, Hong Kong
- Fan Yang 0007 — Beihang University, School of Biological Science and Medical Engineering, Beijing, China
- Fan Yang 0008 — Tongji University, School of Aerospace Engineering and Applied Mechanics, Shanghai, China (and 1 more)
- Fan Yang 0009 — Zhejiang Sci-Tech University, Department of Mathematics, Hangzhou, China
- Fan Yang 0010 — Xiamen University, Department of Automation, China
- Fan Yang 0011 — Huangshan University, School of Information and Engineering, Huangshan City, China
- Fan Yang 0012 — Southeast University, Instrument and Meter Engineering, Nanjing, China
- Fan Yang 0013 — New York University Shanghai, Economics Area, China (and 1 more)
- Fan Yang 0014 — Nantong University, School of Geographical Sciences, China (and 1 more)
- Fan Yang 0015 — University of Chicago, Department of Computer Science, IL, USA
- Fan Yang 0016 — University of Maryland, College Park, MD, USA
- Fan Yang 0017 — University of Paris-Saclay, France
- Fan Yang 0018 — University of Birmingham, UK
- Fan Yang 0019 — University Bourgogne Franche-Comté, CNRS, Arts et Métiers, Dijon, France (and 1 more)
- Fan Yang 0020 — Space Engineering University, Beijing, China
- Fan Yang 0021 — Chongqing University, College of Communication Engineering, Chongqing Engineering Laboratory of High Performance Integrated Circuits, China
- Fan Yang 0022 — Nuance Communications, Inc. (and 1 more)
- Fan Yang 0023 — Wake Forest University, Department of Computer Science, Winston-Salem, NC, USA (and 3 more)
- Fan Yang 0025 — Wuhan University of Science and Technology, City College, China (and 1 more)
- Fan Yang 0026 — Lanzhou University of Technology, School of Science, China (and 1 more)
- Fan Yang 0027 — Tsinghua University, Department of Electronic Engineering, TNList, Beijing, China (and 2 more)
- Fan Yang 0028 — Wuhan University, State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, China (and 1 more)
- Fan Yang 0029 — China University of Geosciences Beijing, School of Earth Sciences and Resources, China (and 1 more)
- Fan Yang 0030 — Huaqiao University, College of Mechanical Engineering and Automation, Xiamen, China (and 1 more)
- Fan Yang 0031 — Chongqing University of Technology, School of Electrical and Electronic Engineering, China
- Fan Yang 0032 — Nara Institute of Science and Technology, Division of Information Science, Japan
- Fan Yang 0033 — Xi'an Jiaotong University, China (and 2 more)
- Fan Yang 0034 — Zhongnan University of Economics and Law, School of Information and Safety Engineering, Wuhan, China (and 1 more)
- Fan Yang 0035 — Temple University, Department of Computer and Information Sciences, Philadelphia, PA, USA
- Fan Yang 0036 — Beijing Jiaotong University, School of Electronic and Information Engineering, China
- Fan Yang 0037 — Hubei Technology of University, School of Computer Science, Wuhan, China
- Fan Yang 0038 — University of Tokyo, Japan
- Fan Yang 0039 — Jiangxi University of Finance and Economics, School of Software and Internet of Things Engineering, Nanchang, China
- Fan Yang 0040 — Northwestern Polytechnical University, School of Computer Science and Engineering, Xi'an, China
- Fan Yang 0041 — Nanjing University of Aeronautics and Astronautics, College of Aerospace Engineering / State Key Laboratory of Mechanics and Control of Mechanical Structures, China (and 1 more)
- Fan Yang 0042 — Jiangxi Science and Technology Normal University, School of Communications and Electronics, Nanchang, China
- Fan Yang 0043 — Xiamen University, School of Informatics, China
- Fan Yang 0044 — Central South University of Forestry and Technology, School of Computer and Information Engineering, Changsha, China (and 1 more)
- Fan Yang 0045 — Chongqing University, School of Electrical Engineering, State Key Laboratory of Power Transmission Equipment and System Security and New Technology, China
- Fan Yang 0046 — Beijing University of Posts and Telecommunications, State Key Laboratory of Networking and Switching Technology, China
- Fan Yang 0047 — Beijing University of Posts and Telecommunications, MoE Key Laboratory of Universal Wireless Communications, China
- Fan Yang 0048 — Xi'an University of Posts and Telecommunications, School of Communication and Information, China
- Fan Yang 0049 — Chinese Academy of Science, Institute of Computing Technology, Beijing, China
- Fan Yang 0050 — Monash University, Faculty of Business and Economics, Melbourne, VIC, Australia
- Fan Yang 0051 — imec-DistriNet, KU Leuven, Belgium
- Fan Yang 0052 — State Grid Hubei Electric Power Company, Electric Power Research Institute, Wuhan, China
- Fan Yang 0053 — Peking University, National Engineering Laboratory for Video Technology, Beijing, China
- Fan Yang 0054 — Inception Institute of Artificial Intelligence, Abu Dhabi, UAE (and 2 more)
- Fan Yang 0055 — University of British Columbia, School of Engineering, Kelowna, BC, Canada (and 1 more)
- Fan Yang 0056 — Southeast University, School of Information Science and Engineering, National Mobile Communications Research Laboratory, Nanjing, China (and 1 more)
- Fan Yang 0057 — State University of New York at Buffalo, NY, USA
- Fan Yang 0058 — Carnegie Mellon University, Pittsburgh, PA, USA
- Fan Yang 0059 — DeepCode Robotics Co. Ltd. (and 1 more)
- Fan Yang 0060 — Avago Technologies, San Jose, CA, USA (and 2 more)
- Fan Yang 0061 — Wuhan University, LIEMARS, China
- Fan Yang 0062 — Beihang University, College of Software, Beijing, China
- Fan Yang 0063 — Hefei University of Technology, School of Computer and Information, China
- Fan Yang 0064 — Southwest University, College of Electronic and Information Engineering, Chongqing Key Laboratory of Nonlinear Circuits and Intelligent Information Processing, Chongqing, China
- Fan Yang 0065 — Guangxi University of Science and Technology, School of Computer Science and Communication Engineering, Liuzhou, China (and 1 more)
- Fan Yang 0066 — Hebei University of Technology, School of Electronic and Information Engineering, China
- Fan Yang 0067 — Jiangsu Normal University, School of Mathematics and Statistics, Xuzhou, China (and 1 more)
- Fan Yang 0068 — Shandong University, Cheeloo College of Medicine, School of Public Health, Department of Epidemiology and Biostatistics, Jinan, China
- Fan Yang 0069 — University of Electronic Science and Technology of China, School of Information and Communication Engineering, Chengdu, China (and 1 more)
- Fan Yang 0070 — Huazhong University of Science and Technology, School of Electrical and Electronic Engineering, State Key Laboratory of Advanced Electromagnetic Engineering and Technology, Wuhan, China
- Fan Yang 0071 — Nanjing University of Finance and Economics, College of Information Engineering, China
- Fan Yang 0072 — Beijing University of Posts and Telecommunications, MOE Key Laboratory of Universal Wireless Communications, China
- Fan Yang 0073 — Southwest Petroleum University, School of Electrical Engineering and Information, Chengdu, China
- Fan Yang 0074 — Jiangsu University of Science and Technology, Department of Mathematics and Physics, Zhenjiang, China (and 1 more)
- Fan Yang 0075 — University of Houston, Department of Computer Science, TX, USA
- Fan Yang 0076 — eBay Inc., San Jose, CA, USA
- Fan Yang 0077 — Peking University, Institute of Microelectronics, MOE Key Laboratory of Microelectronic Devices and Circuits, Beijing, China
- Fan Yang 0078 — Beijing University of Posts and Telecommunications, Pattern Recognition and Intelligent Vision Laboratory, China
- Fan Yang 0079 — Beihang University, School of Reliability and Systems Engineering, Science and Technology on Reliability and Environmental Engineering Laboratory, Beijing, China
- Fan Yang 0080 — Carnegie Mellon University, PA, USA
- Fan Yang 0081 — Tencent AI Lab
- Fan Yang 0082 — South China University of Technology, Guangzhou, China
- Fan Yang 0083 — Tsinghua University, School of Software, Beijing, China
- Fan Yang 0084 — Amazon.com
- Fan Yang 0085 — Shandong University, Data Science Institute, School of Mathematics, Jinan, China
- Fan Yang 0086 — Tsinghua University, Department of Electronic Engineering, Shenzhen International Graduate School, China
- Fan Yang 0087 — Meituan-Dianping Group, Beijing, China (and 1 more)
- Fan Yang 0088 — Huazhong University of Science and Technology, HUST, School of Artificial Intelligence and Automation, Wuhan, Hubei, China
- Fan Yang 0089 — SenseTime Research
- Fan Yang 0090 — University of Illinois Urbana-Champaign, IL, USA
- Fan Yang 0091 — Chinese University of Hong Kong, Department of Computer Science and Engineering, Hong Kong
- Fan Yang 0092 — ETH Zurich, Robotic Systems Lab, Switzerland (and 1 more)
- Fan Yang 0093 — Nankai University, Tianjin, China
- Fan Yang 0094 — KuaiShou Inc., Beijing, China
- Fan Yang 0095 — University of Technology Sydney, Australia
- Fan Yang 0096 — CAS, Institute of Computing Technology, Beijing, China (and 1 more)
- Fan Yang 0097 — University of Electronic Science and Technology of China, School of Information and Communication Engineering, Chengdu, China (and 1 more)
- Fan Yang 0098 — Beijing Institute of Technology, School of Mechanical Engineering, China
- Fan Yang 0099 — Harbin University of Science and Technology, Electrical and Electronic Engineering Department, China (and 1 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Zhiqi Lin, Youshan Miao, Guanbin Xu, Cheng Li, Olli Saarikivi, Saeed Maleki, Fan Yang:
Efficient Schedule Construction for Distributed Execution of Large DNN Models. IEEE Trans. Parallel Distributed Syst. 35(12): 2375-2391 (2024) - [c55]Yue Guan, Yuxian Qiu, Jingwen Leng, Fan Yang, Shuo Yu, Yunxin Liu, Yu Feng, Yuhao Zhu, Lidong Zhou, Yun Liang, Chen Zhang, Chao Li, Minyi Guo:
Amanda: Unified Instrumentation Framework for Deep Neural Networks. ASPLOS (1) 2024: 1-18 - [c54]Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Fan Yang, Mao Yang:
Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning. EMNLP 2024: 13674-13695 - [c53]Guodong Liu, Youshan Miao, Zhiqi Lin, Xiaoxiang Shi, Saeed Maleki, Fan Yang, Yungang Bao, Sa Wang:
Aceso: Efficient Parallel DNN Training through Iterative Bottleneck Alleviation. EuroSys 2024: 163-181 - [c52]Zhiqi Lin, Youshan Miao, Guanbin Xu, Cheng Li, Olli Saarikivi, Saeed Maleki, Fan Yang:
Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search. HPCA 2024: 803-816 - [c51]Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. ICME 2024: 1-6 - [c50]Yiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang, Mao Yang:
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens. ICML 2024 - [c49]Mingzhe Xing, Rongkai Zhang, Hui Xue, Qi Chen, Fan Yang, Zhen Xiao:
Understanding the Weakness of Large Language Model Agents within a Complex Android Environment. KDD 2024: 6061-6072 - [c48]Lei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi, Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang:
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation. OSDI 2024: 307-323 - [c47]Zhiqi Lin, Youshan Miao, Quanlu Zhang, Fan Yang, Yi Zhu, Cheng Li, Saeed Maleki, Xu Cao, Ning Shang, Yilei Yang, Weijiang Xu, Mao Yang, Lintao Zhang, Lidong Zhou:
nnScaler: Constraint-Guided Parallelization Plan Generation for Deep Learning Training. OSDI 2024: 347-363 - [c46]Chaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu:
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable. OSDI 2024: 929-945 - [c45]Qi Chen, Xiubo Geng, Corby Rosset, Carolyn Buractaon, Jingwen Lu, Tao Shen, Kun Zhou, Chenyan Xiong, Yeyun Gong, Paul N. Bennett, Nick Craswell, Xing Xie, Fan Yang, Bryan Tower, Nikhil Rao, Anlei Dong, Wenqi Jiang, Zheng Liu, Mingqin Li, Chuanjie Liu, Zengzhong Li, Rangan Majumder, Jennifer Neville, Andy Oakley, Knut Magne Risvik, Harsha Vardhan Simhadri, Manik Varma, Yujing Wang, Linjun Yang, Mao Yang, Ce Zhang:
MS MARCO Web Search: A Large-scale Information-rich Web Dataset with Millions of Real Click Labels. WWW (Companion Volume) 2024: 292-301 - [c44]Yaoqi Chen, Ruicheng Zheng, Qi Chen, Shuotao Xu, Qianxi Zhang, Xue Wu, Weihao Han, Hua Yuan, Mingqin Li, Yujing Wang, Jason Li, Fan Yang, Hao Sun, Weiwei Deng, Feng Sun, Qi Zhang, Mao Yang:
OneSparse: A Unified System for Multi-index Vector Search. WWW (Companion Volume) 2024: 393-402 - [i31]Mingzhe Xing, Rongkai Zhang, Hui Xue, Qi Chen, Fan Yang, Zhen Xiao:
Understanding the Weakness of Large Language Model Agents within a Complex Android Environment. CoRR abs/2402.06596 (2024) - [i30]Yiran Ding, Li Lyna Zhang, Chengruidong Zhang, Yuanyuan Xu, Ning Shang, Jiahang Xu, Fan Yang, Mao Yang:
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens. CoRR abs/2402.13753 (2024) - [i29]Marah I Abdin, Sam Ade Jacobs, Ammar Ahmad Awan, Jyoti Aneja, Ahmed Awadallah, Hany Awadalla, Nguyen Bach, Amit Bahree, Arash Bakhtiari, Harkirat S. Behl, Alon Benhaim, Misha Bilenko, Johan Bjorck, Sébastien Bubeck, Martin Cai, Caio César Teodoro Mendes, Weizhu Chen, Vishrav Chaudhary, Parul Chopra, Allie Del Giorno, Gustavo de Rosa, Matthew Dixon, Ronen Eldan, Dan Iter, Amit Garg, Abhishek Goswami, Suriya Gunasekar, Emman Haider, Junheng Hao, Russell J. Hewett, Jamie Huynh, Mojan Javaheripi, Xin Jin, Piero Kauffmann, Nikos Karampatziakis, Dongwoo Kim, Mahoud Khademi, Lev Kurilenko, James R. Lee, Yin Tat Lee, Yuanzhi Li, Chen Liang, Weishung Liu, Eric Lin, Zeqi Lin, Piyush Madan, Arindam Mitra, Hardik Modi, Anh Nguyen, Brandon Norick, Barun Patra, Daniel Perez-Becker, Thomas Portet, Reid Pryzant, Heyang Qin, Marko Radmilac, Corby Rosset, Sambudha Roy, Olatunji Ruwase, Olli Saarikivi, Amin Saied, Adil Salim, Michael Santacroce, Shital Shah, Ning Shang, Hiteshi Sharma, Xia Song, Masahiro Tanaka, Xin Wang, Rachel Ward, Guanhua Wang, Philipp Witte, Michael Wyatt, Can Xu, Jiahang Xu, Sonali Yadav, Fan Yang, Ziyi Yang, Donghan Yu, Chengruidong Zhang, Cyril Zhang, Jianwen Zhang, Li Lyna Zhang, Yi Zhang, Yue Zhang, Yunan Zhang, Xiren Zhou:
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone. CoRR abs/2404.14219 (2024) - [i28]Bin Xiao, Chunan Shi, Xiaonan Nie, Fan Yang, Xiangwei Deng, Lei Su, Weipeng Chen, Bin Cui:
Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge. CoRR abs/2405.00263 (2024) - [i27]Qi Chen, Xiubo Geng, Corby Rosset, Carolyn Buractaon, Jingwen Lu, Tao Shen, Kun Zhou, Chenyan Xiong, Yeyun Gong, Paul N. Bennett, Nick Craswell, Xing Xie, Fan Yang, Bryan Tower, Nikhil Rao, Anlei Dong, Wenqi Jiang, Zheng Liu, Mingqin Li, Chuanjie Liu, Zengzhong Li, Rangan Majumder, Jennifer Neville, Andy Oakley, Knut Magne Risvik, Harsha Vardhan Simhadri, Manik Varma, Yujing Wang, Linjun Yang, Mao Yang, Ce Zhang:
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels. CoRR abs/2405.07526 (2024) - [i26]Chaofan Lin, Zhenhua Han, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu:
Parrot: Efficient Serving of LLM-based Applications with Semantic Variable. CoRR abs/2405.19888 (2024) - [i25]Miao Zheng, Hao Liang, Fan Yang, Haoze Sun, Tianpeng Li, Lingchu Xiong, Yan Zhang, Youzhen Wu, Kun Li, Yanjun Shen, Mingan Lin, Tao Zhang, Guosheng Dong, Yujing Qiao, Kun Fang, Weipeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou:
PAS: Data-Efficient Plug-and-Play Prompt Augmentation System. CoRR abs/2407.06027 (2024) - [i24]Tao Zhang, Yanjun Shen, Wenjing Luo, Yan Zhang, Hao Liang, Tao Zhang, Fan Yang, Mingan Lin, Yujing Qiao, Weipeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou:
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs. CoRR abs/2408.01122 (2024) - [i23]Zhiwen Mo, Lei Wang, Jianyu Wei, Zhichen Zeng, Shijie Cao, Lingxiao Ma, Naifeng Jing, Ting Cao, Jilong Xue, Fan Yang, Mao Yang:
LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration. CoRR abs/2408.06003 (2024) - [i22]Zhenting Qi, Mingyuan Ma, Jiahang Xu, Li Lyna Zhang, Fan Yang, Mao Yang:
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers. CoRR abs/2408.06195 (2024) - [i21]Di Liu, Meng Chen, Baotong Lu, Huiqiang Jiang, Zhenhua Han, Qianxi Zhang, Qi Chen, Chengruidong Zhang, Bailu Ding, Kai Zhang, Chen Chen, Fan Yang, Yuqing Yang, Lili Qiu:
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval. CoRR abs/2409.10516 (2024) - 2023
- [c43]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Ming Gong, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. ACL (1) 2023: 1309-1320 - [c42]Diandian Gu, Yihao Zhao, Yinmin Zhong, Yifan Xiong, Zhenhua Han, Peng Cheng, Fan Yang, Gang Huang, Xin Jin, Xuanzhe Liu:
ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning. ASPLOS (2) 2023: 266-280 - [c41]Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-Scale DNN Training. ECAI 2023: 3058-3065 - [c40]Hanyu Zhao, Zhenhua Han, Zhi Yang, Quanlu Zhang, Mingxia Li, Fan Yang, Qianxi Zhang, Binyang Li, Yuqing Yang, Lili Qiu, Lintao Zhang, Lidong Zhou:
SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters. EuroSys 2023: 883-898 - [c39]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. IJCAI 2023: 1506-1514 - [c38]Cong Guo, Jiaming Tang, Weiming Hu, Jingwen Leng, Chen Zhang, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization. ISCA 2023: 3:1-3:15 - [c37]Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, HoYuen Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong:
Tutel: Adaptive Mixture-of-Experts at Scale. MLSys 2023 - [c36]Bin Lin, Ningxin Zheng, Lei Wang, Shijie Cao, Lingxiao Ma, Quanlu Zhang, Yi Zhu, Ting Cao, Jilong Xue, Yuqing Yang, Fan Yang:
Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning. MLSys 2023 - [c35]Hailin Zhang, Yujing Wang, Qi Chen, Ruiheng Chang, Ting Zhang, Ziming Miao, Yingyan Hou, Yang Ding, Xupeng Miao, Haonan Wang, Bochen Pang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Xing Xie, Mao Yang, Bin Cui:
Model-enhanced Vector Index. NeurIPS 2023 - [c34]Chieh-Jan Mike Liang, Zilin Fang, Yuqing Xie, Fan Yang, Zhao Lucis Li, Li Lyna Zhang, Mao Yang, Lidong Zhou:
On Modular Learning of Distributed Systems for Predicting End-to-End Latency. NSDI 2023: 1081-1095 - [c33]Qianxi Zhang, Shuotao Xu, Qi Chen, Guoxin Sui, Jiadong Xie, Zhizhen Cai, Yaoqi Chen, Yinxuan He, Yuqing Yang, Fan Yang, Mao Yang, Lidong Zhou:
VBASE: Unifying Online Vector Similarity Search and Relational Queries via Relaxed Monotonicity. OSDI 2023: 377-395 - [c32]Chen Zhang, Lingxiao Ma, Jilong Xue, Yining Shi, Ziming Miao, Fan Yang, Jidong Zhai, Zhi Yang, Mao Yang:
Cocktailer: Analyzing and Optimizing Dynamic Control Flow in Deep Learning. OSDI 2023: 681-699 - [c31]Yining Shi, Zhi Yang, Jilong Xue, Lingxiao Ma, Yuqing Xia, Ziming Miao, Yuxiao Guo, Fan Yang, Lidong Zhou:
Welder: Scheduling Deep Learning Memory Access via Tile-graph. OSDI 2023: 701-718 - [c30]Weihao Cui, Zhenhua Han, Lingji Ouyang, Yichuan Wang, Ningxin Zheng, Lingxiao Ma, Yuqing Yang, Fan Yang, Jilong Xue, Lili Qiu, Lidong Zhou, Quan Chen, Haisheng Tan, Minyi Guo:
Optimizing Dynamic Neural Networks with Brainstorm. OSDI 2023: 797-815 - [c29]Ningxin Zheng, Huiqiang Jiang, Quanlu Zhang, Zhenhua Han, Lingxiao Ma, Yuqing Yang, Fan Yang, Chengruidong Zhang, Lili Qiu, Mao Yang, Lidong Zhou:
PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation. SOSP 2023: 331-347 - [c28]Yuming Xu, Hengyu Liang, Jin Li, Shuotao Xu, Qi Chen, Qianxi Zhang, Cheng Li, Ziyue Yang, Fan Yang, Yuqing Yang, Peng Cheng, Mao Yang:
SPFresh: Incremental In-Place Update for Billion-Scale Vector Search. SOSP 2023: 545-561 - [i20]Zhiqi Lin, Youshan Miao, Guodong Liu, Xiaoxiang Shi, Quanlu Zhang, Fan Yang, Saeed Maleki, Yi Zhu, Xu Cao, Cheng Li, Mao Yang, Lintao Zhang, Lidong Zhou:
SuperScaler: Supporting Flexible DNN Parallelization via a Unified Abstraction. CoRR abs/2301.08984 (2023) - [i19]Ningxin Zheng, Huiqiang Jiang, Quanlu Zhang, Zhenhua Han, Yuqing Yang, Lingxiao Ma, Fan Yang, Lili Qiu, Mao Yang, Lidong Zhou:
SparDA: Accelerating Dynamic Sparse Deep Neural Networks via Sparse-Dense Transformation. CoRR abs/2301.10936 (2023) - [i18]Xiaodong Wang, Chenfei Wu, Shengming Yin, Minheng Ni, Jianfeng Wang, Linjie Li, Zhengyuan Yang, Fan Yang, Lijuan Wang, Zicheng Liu, Yuejian Fang, Nan Duan:
Learning 3D Photography Videos via Self-supervised Diffusion on Single Images. CoRR abs/2302.10781 (2023) - [i17]Yidan Zhang, Ting Zhang, Dong Chen, Yujing Wang, Qi Chen, Xing Xie, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Mao Yang, Qingmin Liao, Baining Guo:
IRGen: Generative Modeling for Image Retrieval. CoRR abs/2303.10126 (2023) - [i16]Shengming Yin, Chenfei Wu, Huan Yang, Jianfeng Wang, Xiaodong Wang, Minheng Ni, Zhengyuan Yang, Linjie Li, Shuguang Liu, Fan Yang, Jianlong Fu, Gong Ming, Lijuan Wang, Zicheng Liu, Houqiang Li, Nan Duan:
NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation. CoRR abs/2303.12346 (2023) - [i15]Cong Guo, Jiaming Tang, Weiming Hu, Jingwen Leng, Chen Zhang, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
OliVe: Accelerating Large Language Models via Hardware-friendly Outlier-Victim Pair Quantization. CoRR abs/2304.07493 (2023) - [i14]Yijia Zhang, Lingran Zhao, Shijie Cao, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. CoRR abs/2305.12356 (2023) - [i13]Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training. CoRR abs/2305.19982 (2023) - [i12]Hailin Zhang, Yujing Wang, Qi Chen, Ruiheng Chang, Ting Zhang, Ziming Miao, Yingyan Hou, Yang Ding, Xupeng Miao, Haonan Wang, Bochen Pang, Yuefeng Zhan, Hao Sun, Weiwei Deng, Qi Zhang, Fan Yang, Xing Xie, Mao Yang, Bin Cui:
Model-enhanced Vector Index. CoRR abs/2309.13335 (2023) - [i11]Hongyu Wang, Shuming Ma, Li Dong, Shaohan Huang, Huaijie Wang, Lingxiao Ma, Fan Yang, Ruiping Wang, Yi Wu, Furu Wei:
BitNet: Scaling 1-bit Transformers for Large Language Models. CoRR abs/2310.11453 (2023) - [i10]Zhiqi Lin, Youshan Miao, Guanbin Xu, Cheng Li, Olli Saarikivi, Saeed Maleki, Fan Yang:
Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule Search. CoRR abs/2311.15269 (2023) - 2022
- [c27]Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan:
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion. ECCV (16) 2022: 720-736 - [c26]Cong Guo, Yuxian Qiu, Jingwen Leng, Chen Zhang, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo:
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training. ICCD 2022: 738-745 - [c25]Cong Guo, Yuxian Qiu, Jingwen Leng, Xiaotian Gao, Chen Zhang, Yunxin Liu, Fan Yang, Yuhao Zhu, Minyi Guo:
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation. ICLR 2022 - [c24]Cong Guo, Chen Zhang, Jingwen Leng, Zihan Liu, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization. MICRO 2022: 1414-1433 - [c23]Ningxin Zheng, Bin Lin, Quanlu Zhang, Lingxiao Ma, Yuqing Yang, Fan Yang, Yang Wang, Mao Yang, Lidong Zhou:
SparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute. OSDI 2022: 213-232 - [c22]Hongyu Zhu, Ruofan Wu, Yijia Diao, Shanbin Ke, Haoyu Li, Chen Zhang, Jilong Xue, Lingxiao Ma, Yuqing Xia, Wei Cui, Fan Yang, Mao Yang, Lidong Zhou, Asaf Cidon, Gennady Pekhimenko:
ROLLER: Fast and Efficient Tensor Compilation for Deep Learning. OSDI 2022: 233-248 - [c21]Shitao Xiao, Zheng Liu, Weihao Han, Jianjin Zhang, Defu Lian, Yeyun Gong, Qi Chen, Fan Yang, Hao Sun, Yingxia Shao, Xing Xie:
Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings. SIGIR 2022: 1513-1523 - [c20]Wei Zhang, Binghao Chen, Zhenhua Han, Quan Chen, Peng Cheng, Fan Yang, Ran Shu, Yuqing Yang, Minyi Guo:
PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training. USENIX ATC 2022: 217-232 - [i9]Cong Guo, Yuxian Qiu, Jingwen Leng, Xiaotian Gao, Chen Zhang, Yunxin Liu, Fan Yang, Yuhao Zhu, Minyi Guo:
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation. CoRR abs/2202.07471 (2022) - [i8]Shitao Xiao, Zheng Liu, Weihao Han, Jianjin Zhang, Defu Lian, Yeyun Gong, Qi Chen, Fan Yang, Hao Sun, Yingxia Shao, Denvy Deng, Qi Zhang, Xing Xie:
Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings. CoRR abs/2204.00185 (2022) - [i7]Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong:
Tutel: Adaptive Mixture-of-Experts at Scale. CoRR abs/2206.03382 (2022) - [i6]Cong Guo, Chen Zhang, Jingwen Leng, Zihan Liu, Fan Yang, Yunxin Liu, Minyi Guo, Yuhao Zhu:
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization. CoRR abs/2208.14286 (2022) - [i5]Cong Guo, Yuxian Qiu, Jingwen Leng, Chen Zhang, Ying Cao, Quanlu Zhang, Yunxin Liu, Fan Yang, Minyi Guo:
Nesting Forward Automatic Differentiation for Memory-Efficient Deep Neural Network Training. CoRR abs/2209.10778 (2022) - 2021
- [i4]Chenfei Wu, Lun Huang, Qianxi Zhang, Binyang Li, Lei Ji, Fan Yang, Guillermo Sapiro, Nan Duan:
GODIVA: Generating Open-DomaIn Videos from nAtural Descriptions. CoRR abs/2104.14806 (2021) - [i3]Chenfei Wu, Jian Liang, Lei Ji, Fan Yang, Yuejian Fang, Daxin Jiang, Nan Duan:
NÜWA: Visual Synthesis Pre-training for Neural visUal World creAtion. CoRR abs/2111.12417 (2021) - 2020
- [c19]Xuan Peng, Xuanhua Shi, Hulin Dai, Hai Jin, Weiliang Ma, Qian Xiong, Fan Yang, Xuehai Qian:
Capuchin: Tensor-based GPU Memory Management for Deep Learning. ASPLOS 2020: 891-905 - [c18]Yaobo Liang, Nan Duan, Yeyun Gong, Ning Wu, Fenfei Guo, Weizhen Qi, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Xiaodong Fan, Ruofei Zhang, Rahul Agrawal, Edward Cui, Sining Wei, Taroon Bharti, Ying Qiao, Jiun-Hung Chen, Winnie Wu, Shuguang Liu, Fan Yang, Daniel Campos, Rangan Majumder, Ming Zhou:
XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation. EMNLP (1) 2020: 6008-6018 - [c17]Qiushi Li, Wenwu Zhu, Chao Wu, Xinglin Pan, Fan Yang, Yuezhi Zhou, Yaoxue Zhang:
InvisibleFL: Federated Learning over Non-Informative Intermediate Updates against Multimedia Privacy Leakages. ACM Multimedia 2020: 753-762 - [c16]Hanyu Zhao, Zhenhua Han, Zhi Yang, Quanlu Zhang, Fan Yang, Lidong Zhou, Mao Yang, Francis C. M. Lau, Yuqi Wang, Yifan Xiong, Bin Wang:
HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees. OSDI 2020: 515-532 - [c15]Lingxiao Ma, Zhiqiang Xie, Zhi Yang, Jilong Xue, Youshan Miao, Wei Cui, Wenxiang Hu, Fan Yang, Lintao Zhang, Lidong Zhou:
Rammer: Enabling Holistic Deep Learning Compiler Optimizations with rTasks. OSDI 2020: 881-897 - [c14]Quanlu Zhang, Zhenhua Han, Fan Yang, Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou:
Retiarii: A Deep Learning Exploratory-Training Framework. OSDI 2020: 919-936 - [i2]Yaobo Liang, Nan Duan, Yeyun Gong, Ning Wu, Fenfei Guo, Weizhen Qi, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Xiaodong Fan, Bruce Zhang, Rahul Agrawal, Edward Cui, Sining Wei, Taroon Bharti, Ying Qiao, Jiun-Hung Chen, Winnie Wu, Shuguang Liu, Fan Yang, Rangan Majumder, Ming Zhou:
XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation. CoRR abs/2004.01401 (2020)
2010 – 2019
- 2019
- [c13]Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang:
Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. USENIX ATC 2019: 947-960 - [i1]Myeongjae Jeon, Shivaram Venkataraman, Amar Phanishayee, Junjie Qian, Wencong Xiao, Fan Yang:
Analysis of Large-Scale Multi-Tenant GPU Clusters for DNN Training Workloads. CoRR abs/1901.05758 (2019) - 2018
- [c12]Wencong Xiao, Zhenhua Han, Hanyu Zhao, Xuan Peng, Quanlu Zhang, Fan Yang, Lidong Zhou:
Scheduling CPU for GPU-based Deep Learning Jobs. SoCC 2018: 503 - [c11]Wencong Xiao, Romil Bhardwaj, Ramachandran Ramjee, Muthian Sivathanu, Nipun Kwatra, Zhenhua Han, Pratyush Patel, Xuan Peng, Hanyu Zhao, Quanlu Zhang, Fan Yang, Lidong Zhou:
Gandiva: Introspective Cluster Scheduling for Deep Learning. OSDI 2018: 595-610 - 2015
- [j8]Youshan Miao, Wentao Han, Kaiwei Li, Ming Wu, Fan Yang, Lidong Zhou, Vijayan Prabhakaran, Enhong Chen, Wenguang Chen:
ImmortalGraph: A System for Storage and Analysis of Temporal Graphs. ACM Trans. Storage 11(3): 14:1-14:34 (2015) - [c10]Ming Wu, Fan Yang, Jilong Xue, Wencong Xiao, Youshan Miao, Lan Wei, Haoxiang Lin, Yafei Dai, Lidong Zhou:
GraM: scaling graph computation to the trillions. SoCC 2015: 408-421 - 2014
- [c9]Wentao Han, Youshan Miao, Kaiwei Li, Ming Wu, Fan Yang, Lidong Zhou, Vijayan Prabhakaran, Wenguang Chen, Enhong Chen:
Chronos: a graph engine for temporal graph analysis. EuroSys 2014: 1:1-1:14 - 2012
- [c8]Raymond Cheng, Ji Hong, Aapo Kyrola, Youshan Miao, Xuetian Weng, Ming Wu, Fan Yang, Lidong Zhou, Feng Zhao, Enhong Chen:
Kineograph: taking the pulse of a fast-changing and connected world. EuroSys 2012: 85-98
2000 – 2009
- 2007
- [j7]Qian Zhang, Qing Chen, Fan Yang, Xuemin Shen, Zhisheng Niu:
Cooperative and opportunistic transmission for wireless ad hoc networks. IEEE Netw. 21(1): 14-20 (2007) - [j6]Kun Wang, Fan Yang, Qian Zhang, Dapeng Oliver Wu, Yinlong Xu:
Distributed Cooperative Rate Adaptation for Energy Efficiency in IEEE 802.11-Based Multihop Networks. IEEE Trans. Veh. Technol. 56(2): 888-898 (2007) - [j5]Kun Wang, Fan Yang, Qian Zhang, Yinlong Xu:
Modeling path capacity in multi-hop IEEE 802.11 networks for QoS services. IEEE Trans. Wirel. Commun. 6(2): 738-749 (2007) - 2006
- [j4]Haitao Wu, Fan Yang, Kun Tan, Jie Chen, Qian Zhang, Zhensheng Zhang:
Distributed Channel Assignment and Routing in Multiradio Multichannel Multihop Wireless Networks. IEEE J. Sel. Areas Commun. 24(11): 1972-1983 (2006) - [j3]Jin Zhao, Fan Yang, Qian Zhang, Zhensheng Zhang, Fuyan Zhang:
LION: Layered Overlay Multicast With Network Coding. IEEE Trans. Multim. 8(5): 1021-1032 (2006) - [c7]Jin Zhao, Fan Yang, Qian Zhang, Zhensheng Zhang:
On Improving the Throughput of Media Delivery Applications in Heterogeneous Overlay Network. GLOBECOM 2006 - [c6]Cong Peng, Fan Yang, Qian Zhang, Dapeng Wu, Ming Zhao, Yan Yao:
Impact of Power and Rate Selection on the Throughput of Ad Hoc Networks. ICC 2006: 3897-3902 - [c5]Kun Wang, Fan Yang, Qian Zhang, Yinlong Xu, Feng Wang:
Modeling Path Capacity in Multi-hop IEEE 802.11 Networks for QoS Services. MASS 2006: 771-776 - [c4]Kun Wang, Fan Yang, Qian Zhang, Dapeng Oliver Wu, Yinlong Xu:
Distributed cooperative rate adaptation for energy efficiency in IEEE 802.11-based multi-hop networks. QSHINE 2006: 1 - 2005
- [j2]Qian Zhang, Fan Yang, Wenwu Zhu:
Cross-layer QoS Support for Multimedia Delivery over Wireless Internet. EURASIP J. Adv. Signal Process. 2005(2): 207-219 (2005) - [c3]Kultida Rojviboonchai, Fan Yang, Qian Zhang, Hitoshi Aida, Wenwu Zhu:
AMTP: a multipath multimedia streaming protocol for mobile ad hoc networks. ICC 2005: 1246-1250 - 2004
- [j1]Fan Yang, Qian Zhang, Wenwu Zhu, Ya-Qin Zhang:
End-to-end TCP-friendly streaming protocol and bit allocation for scalable video over wireless Internet. IEEE J. Sel. Areas Commun. 22(4): 777-790 (2004) - [c2]Fan Yang, Qian Zhang, Wenwu Zhu, Ya-Qin Zhang:
Streaming and Bit Allocation for Scalable Video over Mobile Wireless Internet. INFOCOM 2004: 2142-2151 - 2003
- [c1]Fan Yang, Qian Zhang, Wenwu Zhu, Ya-Qin Zhang:
An end-to-end TCP-friendly streaming protocol for multimedia over wireless Internet. ICME 2003: 429-432
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 20:38 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint