default search action
Guoping Long
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i12]Mingcong Song, Xinru Tang, Fengfan Hou, Jing Li, Wei Wei, Yipeng Ma, Runqiu Xiao, Hongjie Si, Dingcheng Jiang, Shouyi Yin, Yang Hu, Guoping Long:
Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels. CoRR abs/2412.18106 (2024) - 2022
- [c34]Zhen Zheng, Xuanda Yang, Pengzhan Zhao, Guoping Long, Kai Zhu, Feiwen Zhu, Wenyi Zhao, Xiaoyong Liu, Jun Yang, Jidong Zhai, Shuaiwen Leon Song, Wei Lin:
AStitch: enabling a new multi-dimensional optimization space for memory-intensive ML training and inference on modern SIMT architectures. ASPLOS 2022: 359-373 - [c33]Ziyue Luo, Xiaodong Yi, Guoping Long, Shiqing Fan, Chuan Wu, Jun Yang, Wei Lin:
Efficient Pipeline Planning for Expedited Distributed DNN Training. INFOCOM 2022: 340-349 - [i11]Ziyue Luo, Xiaodong Yi, Guoping Long, Shiqing Fan, Chuan Wu, Jun Yang, Wei Lin:
Efficient Pipeline Planning for Expedited Distributed DNN Training. CoRR abs/2204.10562 (2022) - 2021
- [c32]Shiqing Fan, Yi Rong, Chen Meng, Zongyan Cao, Siyu Wang, Zhen Zheng, Chuan Wu, Guoping Long, Jun Yang, Lixue Xia, Lansong Diao, Xiaoyong Liu, Wei Lin:
DAPPLE: a pipelined data parallel approach for training large models. PPoPP 2021: 431-445 - 2020
- [j4]Jia He, Changying Du, Fuzhen Zhuang, Xin Yin, Qing He, Guoping Long:
Online Bayesian max-margin subspace learning for multi-view classification and regression. Mach. Learn. 109(2): 219-249 (2020) - [c31]Xiaodong Yi, Shiwei Zhang, Ziyue Luo, Guoping Long, Lansong Diao, Chuan Wu, Zhen Zheng, Jun Yang, Wei Lin:
Optimizing distributed training deployment in heterogeneous GPU clusters. CoNEXT 2020: 93-107 - [c30]Xiaodong Yi, Ziyue Luo, Chen Meng, Mengdi Wang, Guoping Long, Chuan Wu, Jun Yang, Wei Lin:
Fast Training of Deep Learning Models over Multiple GPUs. Middleware 2020: 105-118 - [i10]Shiqing Fan, Yi Rong, Chen Meng, Zongyan Cao, Siyu Wang, Zhen Zheng, Chuan Wu, Guoping Long, Jun Yang, Lixue Xia, Lansong Diao, Xiaoyong Liu, Wei Lin:
DAPPLE: A Pipelined Data Parallel Approach for Training Large Models. CoRR abs/2007.01045 (2020) - [i9]Siyu Wang, Yi Rong, Shiqing Fan, Zhen Zheng, Lansong Diao, Guoping Long, Jun Yang, Xiaoyong Liu, Wei Lin:
Auto-MAP: A DQN Framework for Exploring Distributed Execution Plans for DNN Workloads. CoRR abs/2007.04069 (2020) - [i8]Zhen Zheng, Pengzhan Zhao, Guoping Long, Feiwen Zhu, Kai Zhu, Wenyi Zhao, Lansong Diao, Jun Yang, Wei Lin:
FusionStitching: Boosting Memory Intensive Computations for Deep Learning Workloads. CoRR abs/2009.10924 (2020)
2010 – 2019
- 2019
- [c29]Mengdi Wang, Chen Meng, Guoping Long, Chuan Wu, Jun Yang, Wei Lin, Yangqing Jia:
Characterizing Deep Learning Training Workloads on Alibaba-PAI. IISWC 2019: 189-202 - [i7]Changying Du, Fuzhen Zhuang, Jia He, Qing He, Guoping Long:
Learning beyond Predefined Label Space via Bayesian Nonparametric Topic Modelling. CoRR abs/1910.04420 (2019) - [i6]Changying Du, Jia He, Changde Du, Fuzhen Zhuang, Qing He, Guoping Long:
Efficient and Adaptive Kernelization for Nonlinear Max-margin Multi-view Learning. CoRR abs/1910.05250 (2019) - [i5]Mengdi Wang, Chen Meng, Guoping Long, Chuan Wu, Jun Yang, Wei Lin, Yangqing Jia:
Characterizing Deep Learning Training Workloads on Alibaba-PAI. CoRR abs/1910.05930 (2019) - [i4]Guoping Long, Jun Yang, Wei Lin:
FusionStitching: Boosting Execution Efficiency of Memory Intensive Computations for DL Workloads. CoRR abs/1911.11576 (2019) - 2018
- [i3]Guoping Long, Jun Yang, Kai Zhu, Wei Lin:
FusionStitching: Deep Fusion and Code Generation for Tensorflow Computations on GPUs. CoRR abs/1811.05213 (2018) - 2017
- [c28]Xiaoyu Shen, Hui Su, Yanran Li, Wenjie Li, Shuzi Niu, Yang Zhao, Akiko Aizawa, Guoping Long:
A Conditional Variational Framework for Dialog Generation. ACL (2) 2017: 504-509 - [c27]Jia He, Changying Du, Changde Du, Fuzhen Zhuang, Qing He, Guoping Long:
Nonlinear Maximum Margin Multi-View Learning with Adaptive Kernel. IJCAI 2017: 1830-1836 - [i2]Xiaoyu Shen, Hui Su, Yanran Li, Wenjie Li, Shuzi Niu, Yang Zhao, Akiko Aizawa, Guoping Long:
A Conditional Variational Framework for Dialog Generation. CoRR abs/1705.00316 (2017) - 2016
- [j3]Wenjing Ma, Kan Gao, Guoping Long:
Highly Optimized Code Generation for Stencil Codes with Computation Reuse for GPUs. J. Comput. Sci. Technol. 31(6): 1262-1274 (2016) - [c26]Jia He, Changying Du, Fuzhen Zhuang, Xin Yin, Qing He, Guoping Long:
Online Bayesian Max-Margin Subspace Multi-View Learning. IJCAI 2016: 1555-1561 - [c25]Siqi Deng, Kan Gao, Changying Du, Wenjing Ma, Guoping Long, Yucheng Li:
Online variational Bayesian Support Vector Regression. IJCNN 2016: 3950-3957 - [c24]Wenjing Ma, Liangliang Cao, Lei Yu, Guoping Long, Yucheng Li:
GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring. ICMR 2016: 39-46 - [c23]Changde Du, Changying Du, Shandian Zhe, Ali Luo, Qing He, Guoping Long:
Bayesian Group Feature Selection for Support Vector Learning Machines. PAKDD (1) 2016: 239-252 - [c22]Changying Du, Fuzhen Zhuang, Jia He, Qing He, Guoping Long:
Learning Beyond Predefined Label Space via Bayesian Nonparametric Topic Modelling. ECML/PKDD (1) 2016: 148-164 - [c21]Changying Du, Changde Du, Guoping Long, Xin Jin, Yucheng Li:
Efficient Bayesian Maximum Margin Multiple Kernel Learning. ECML/PKDD (1) 2016: 165-181 - [c20]Changying Du, Changde Du, Guoping Long, Qing He, Yucheng Li:
Online Bayesian Multiple Kernel Bipartite Ranking. UAI 2016 - [c19]Ning Bu, Shuzi Niu, Lei Yu, Wenjing Ma, Guoping Long:
Bridging Semantic Gap Between App Names: Collective Matrix Factorization for Similar Mobile App Recommendation. WISE (2) 2016: 324-339 - [i1]Wenjing Ma, Liangliang Cao, Lei Yu, Guoping Long, Yucheng Li:
GPU-FV: Realtime Fisher Vector and Its Applications in Video Monitoring. CoRR abs/1604.03498 (2016) - 2015
- [c18]Chenggang Zhou, Qiankun Dong, Wenjing Ma, Guoping Long, Tao Li:
PE-TLD: Parallel Extended Tracking-Learning-Detection for Multi-target Tracking. ICA3PP (2) 2015: 665-677 - [c17]Ning Bu, Lei Yu, Wenjing Ma, Changying Du, Shuzi Niu, Guoping Long:
Detect Similar Mobile Applications with Transfer Learning. SmartCity 2015: 856-859 - [c16]Shuzi Niu, Yanyan Lan, Jiafeng Guo, Xueqi Cheng, Lei Yu, Guoping Long:
Listwise Approach for Rank Aggregation in Crowdsourcing. WSDM 2015: 253-262 - 2014
- [c15]Zhenhua Wu, Wenjing Ma, Guoping Long, Yucheng Li, Qiuyan Tang, Zhongjie Wang:
High performance two-dimensional phase unwrapping on GPUs. Conf. Computing Frontiers 2014: 35:1-35:10 - 2013
- [j2]Yan Li, Yunquan Zhang, Yiqung Liu, Guoping Long, Haipeng Jia:
MPFFT: An Auto-Tuning FFT Library for OpenCL GPUs. J. Comput. Sci. Technol. 28(1): 90-105 (2013) - [c14]Hebatallah Saadeldeen, Diana Franklin, Guoping Long, Charlotte Hill, Aisha Browne, Dmitri B. Strukov, Timothy Sherwood, Frederic T. Chong:
Memristors for neural branch prediction: a case study in strict latency and write endurance challenges. Conf. Computing Frontiers 2013: 26:1-26:10 - [c13]Weiyan Wang, Yunquan Zhang, Guoping Long, Shengen Yan, Haipeng Jia:
CLSIFT: An Optimization Study of the Scale Invariance Feature Transform on GPUs. HPCC/EUC 2013: 93-100 - [c12]Shengen Yan, Guoping Long, Yunquan Zhang:
StreamScan: fast scan algorithms for GPUs without global barrier synchronization. PPoPP 2013: 229-238 - 2012
- [c11]Haipeng Jia, Yunquan Zhang, Guoping Long, Jianliang Xu, Shengen Yan, Yan Li:
GPURoofline: A Model for Guiding Performance Optimizations on GPUs. Euro-Par 2012: 920-932 - [c10]Haipeng Jia, Yunquan Zhang, Guoping Long, Shengen Yan:
An Insightful Program Performance Tuning Chain for GPU Computing. ICA3PP (1) 2012: 502-516 - 2011
- [c9]Xiangzheng Sun, Yunquan Zhang, Ting Wang, Guoping Long, Xianyi Zhang, Yan Li:
CRSD: Application Specific Auto-tuning of SpMV for Diagonal Sparse Matrices. Euro-Par (2) 2011: 316-327 - [c8]Yan Li, Yunquan Zhang, Haipeng Jia, Guoping Long, Ke Wang:
Automatic FFT Performance Tuning on OpenCL GPUs. ICPADS 2011: 228-235 - 2010
- [c7]Guoping Long, Diana Franklin, Susmit Biswas, Pablo J. Ortiz, Jason Oberg, Dongrui Fan, Frederic T. Chong:
Minimal Multi-threading: Finding and Removing Redundant Instructions in Multi-threaded Processors. MICRO 2010: 337-348
2000 – 2009
- 2009
- [j1]Dongrui Fan, Nan Yuan, Junchao Zhang, Yongbin Zhou, Wei Lin, Fenglong Song, Xiaochun Ye, He Huang, Lei Yu, Guoping Long, Hao Zhang, Lei Liu:
Godson-T: An Efficient Many-Core Architecture for Parallel Program Executions. J. Comput. Sci. Technol. 24(6): 1061-1073 (2009) - [c6]Guoping Long, Dongrui Fan, Junchao Zhang:
Characterizing and Understanding the Bandwidth Behavior of Workloads on Multi-core Processors. Euro-Par 2009: 110-121 - [c5]Guoping Long, Dongrui Fan, Junchao Zhang:
Architectural support for cilk computations on many-core architectures. PPoPP 2009: 285-286 - 2008
- [c4]Guoping Long, Dongrui Fan, Junchao Zhang, Fenglong Song, Nan Yuan, Wei Lin:
A Performance Model of Dense Matrix Operations on Many-Core Architectures. Euro-Par 2008: 120-129 - [c3]Guoping Long, Nan Yuan, Dongrui Fan:
Location Consistency Model Revisited: Problem, Solution and Prospects. PDCAT 2008: 91-98 - 2007
- [c2]Xuehai Qian, He Huang, Hao Zhang, Guoping Long, Junchao Zhang, Dongrui Fan:
Design and Implementation of Floating Point Stack on General RISC Architecture. PDP 2007: 238-245 - 2004
- [c1]Zhigang Chen, Anfeng Liu, Guoping Long:
A Resource Organizing Protocol for Grid Based on Bounded Two-Level Broadcasting Technique. GCC 2004: 472-478
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-27 00:43 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint