default search action
Xiaoteng Ma
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c24]Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao:
Learning Diverse Risk Preferences in Population-Based Self-Play. AAAI 2024: 12910-12918 - [c23]Qihan Liu, Jianing Ye, Xiaoteng Ma, Jun Yang, Bin Liang, Chongjie Zhang:
Efficient Multi-agent Reinforcement Learning by Planning. ICLR 2024 - [c22]Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Li Xiu, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. ICLR 2024 - [c21]Zhipeng Liang, Xiaoteng Ma, José H. Blanchet, Jun Yang, Jiheng Zhang, Zhengyuan Zhou:
Single-Trajectory Distributionally Robust Reinforcement Learning. ICML 2024 - [c20]Xiaoteng Ma, Qing Li, Junkun Peng, Gareth Tyson, Ziwen Ye, Shisong Tang, Qian Ma, Shengbin Meng, Gabriel-Miro Muntean:
Smart Data-Driven Proactive Push to Edge Network for User-Generated Videos. INFOCOM 2024: 511-520 - [c19]Ziwen Ye, Qing Li, Chunyu Qiao, Xiaoteng Ma, Yong Jiang, Qian Ma, Shengbin Meng, Zhenhui Yuan, Zili Meng:
KEPC-Push: A Knowledge-Enhanced Proactive Content Push Strategy for Edge-Assisted Video Feed Streaming. USENIX ATC 2024: 321-338 - [i22]Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. CoRR abs/2402.03807 (2024) - [i21]Qihan Liu, Jianing Ye, Xiaoteng Ma, Jun Yang, Bin Liang, Chongjie Zhang:
Efficient Multi-agent Reinforcement Learning by Planning. CoRR abs/2405.11778 (2024) - 2023
- [j7]Shuai Ma, Xiaoteng Ma, Li Xia:
A unified algorithm framework for mean-variance optimization in discounted Markov decision processes. Eur. J. Oper. Res. 311(3): 1057-1067 (2023) - [j6]Ziwen Ye, Qing Li, Xiaoteng Ma, Dan Zhao, Yong Jiang, Lianbo Ma, Bo Yi, Gabriel-Miro Muntean:
VRCT: A Viewport Reconstruction-Based 360° Video Caching Solution for Tile-Adaptive Streaming. IEEE Trans. Broadcast. 69(3): 691-703 (2023) - [c18]Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li:
Uncertainty-Driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning. ECAI 2023: 3018-3025 - [c17]Rui Yang, Lin Yong, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? ICML 2023: 39543-39571 - [c16]Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao:
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning (Extended Abstract). IJCAI 2023: 6925-6930 - [c15]Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. NeurIPS 2023 - [i20]Zhipeng Liang, Xiaoteng Ma, Jose H. Blanchet, Jiheng Zhang, Zhengyuan Zhou:
Single-Trajectory Distributionally Robust Reinforcement Learning. CoRR abs/2301.11721 (2023) - [i19]Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li:
Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning. CoRR abs/2304.04660 (2023) - [i18]Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao:
Learning Diverse Risk Preferences in Population-based Self-play. CoRR abs/2305.11476 (2023) - [i17]Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. CoRR abs/2305.17625 (2023) - [i16]Rui Yang, Yong Lin, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? CoRR abs/2305.18882 (2023) - 2022
- [j5]Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao:
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning. J. Artif. Intell. Res. 75: 569-595 (2022) - [j4]Xiaoteng Ma, Qing Li, Longhao Zou, Junkun Peng, Jianer Zhou, Jimeng Chai, Yong Jiang, Gabriel-Miro Muntean:
QAVA: QoE-Aware Adaptive Video Bitrate Aggregation for HTTP Live Streaming Based on Smart Edge Computing. IEEE Trans. Broadcast. 68(3): 661-676 (2022) - [j3]Xiaoteng Ma, Qing Li, Yong Jiang, Gabriel-Miro Muntean, Longhao Zou:
Learning-Based Joint QoE Optimization for Adaptive Video Streaming Based on Smart Edge. IEEE Trans. Netw. Serv. Manag. 19(2): 1789-1806 (2022) - [c14]Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Xiu Li:
Efficient Continuous Control with Double Actors and Regularized Critics. AAAI 2022: 7655-7663 - [c13]Xiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu:
Offline Reinforcement Learning with Value-based Episodic Memory. ICLR 2022 - [c12]Jiafei Lyu, Xiaoteng Ma, Xiu Li, Zongqing Lu:
Mildly Conservative Q-Learning for Offline Reinforcement Learning. NeurIPS 2022 - [c11]Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou:
Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping. NeurIPS 2022 - [c10]Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. NeurIPS 2022 - [c9]Shisong Tang, Qing Li, Xiaoteng Ma, Ci Gao, Dingmin Wang, Yong Jiang, Qian Ma, Aoyang Zhang, Hechang Chen:
Knowledge-based Temporal Fusion Network for Interpretable Online Video Popularity Prediction. WWW 2022: 2879-2887 - [c8]Junkun Peng, Qing Li, Xiaoteng Ma, Yong Jiang, Yutao Dong, Chuang Hu, Meng Chen:
MagNet: Cooperative Edge Caching by Automatic Content Congregating. WWW 2022: 3280-3288 - [i15]Shuai Ma, Xiaoteng Ma, Li Xia:
A unified algorithm framework for mean-variance optimization in discounted Markov decision processes. CoRR abs/2201.05737 (2022) - [i14]Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. CoRR abs/2206.02829 (2022) - [i13]Jiafei Lyu, Xiaoteng Ma, Xiu Li, Zongqing Lu:
Mildly Conservative Q-Learning for Offline Reinforcement Learning. CoRR abs/2206.04745 (2022) - [i12]Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao:
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning. CoRR abs/2206.07376 (2022) - [i11]Xiaoteng Ma, Zhipeng Liang, Jose H. Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou:
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation. CoRR abs/2209.06620 (2022) - [i10]Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou:
Exploiting Reward Shifting in Value-Based Deep RL. CoRR abs/2209.07288 (2022) - 2021
- [j2]Qiyuan Zhang, Xiaoteng Ma, Yiqin Yang, Chenghao Li, Jun Yang, Yu Liu, Bin Liang:
Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning. IEEE Robotics Autom. Lett. 6(4): 6601-6607 (2021) - [j1]Aoyang Zhang, Qing Li, Ying Chen, Xiaoteng Ma, Longhao Zou, Yong Jiang, Zhimin Xu, Gabriel-Miro Muntean:
Video Super-Resolution and Caching - An Edge-Assisted Adaptive Video Streaming Solution. IEEE Trans. Broadcast. 67(4): 799-812 (2021) - [c7]Xiaoteng Ma, Yiqin Yang, Chenghao Li, Yiwen Lu, Qianchuan Zhao, Jun Yang:
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning. AAMAS 2021: 853-861 - [c6]Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang, Qianchuan Zhao:
Average-Reward Reinforcement Learning with Trust Region Methods. IJCAI 2021: 2797-2803 - [c5]Yiqin Yang, Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao:
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning. NeurIPS 2021: 10299-10312 - [i9]Xiaoteng Ma, Yiqin Yang, Chenghao Li, Yiwen Lu, Qianchuan Zhao, Yang Jun:
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2102.06042 (2021) - [i8]Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Xiu Li:
Efficient Continuous Control with Double Actors and Regularized Critics. CoRR abs/2106.03050 (2021) - [i7]Yiqin Yang, Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao:
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning. CoRR abs/2106.03400 (2021) - [i6]Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang, Qianchuan Zhao:
Average-Reward Reinforcement Learning with Trust Region Methods. CoRR abs/2106.03442 (2021) - [i5]Kailai Sun, Xiaoteng Ma, Qianchuan Zhao, Peng Liu:
MGPSN: Motion-Guided Pseudo Siamese Network for Indoor Video Head Detection. CoRR abs/2110.03302 (2021) - [i4]Xiaoteng Ma, Yiqin Yang, Hao Hu, Qihan Liu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang:
Offline Reinforcement Learning with Value-based Episodic Memory. CoRR abs/2110.09796 (2021) - 2020
- [c4]Chenghao Li, Xiaoteng Ma, Li Xia, Qianchuan Zhao, Jun Yang:
Fairness Control of Traffic Light via Deep Reinforcement Learning. CASE 2020: 652-658 - [i3]Xiaoteng Ma, Qiyuan Zhang, Li Xia, Zhengyuan Zhou, Jun Yang, Qianchuan Zhao:
Distributional Soft Actor Critic for Risk Sensitive Learning. CoRR abs/2004.14547 (2020) - [i2]Ming Zhang, Yawei Wang, Xiaoteng Ma, Li Xia, Jun Yang, Zhiheng Li, Xiu Li:
Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration. CoRR abs/2006.03503 (2020) - [i1]Chenghao Li, Xiaoteng Ma, Chongjie Zhang, Jun Yang, Li Xia, Qianchuan Zhao:
SOAC: The Soft Option Actor-Critic Architecture. CoRR abs/2006.14363 (2020)
2010 – 2019
- 2019
- [c3]Teng Long, Xiaoteng Ma, Qing-Shan Jia:
Bi-level Proximal Policy optimization for Stochastic Coordination of EV Charging Load with Uncertain Wind Power. CCTA 2019: 302-307 - [c2]Xiaoteng Ma, Qing Li, Jimeng Chai, Xi Xiao, Shu-Tao Xia, Yong Jiang:
Steward: smart edge based joint QoE optimization for adaptive video streaming. NOSSDAV 2019: 31-36 - 2018
- [c1]Kailai Sun, Qianchuan Zhao, Jianhong Zou, Xiaoteng Ma:
Attendance and Security System Based on Building Video Surveillance. ICSCIB 2018: 153-162
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-21 20:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint