default search action

combined dblp search
author search
venue search
publication search

ask others

Xiaoteng Ma

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JiangLMLYYLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JiangLMLYYLZ24
Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao:
Learning Diverse Risk Preferences in Population-Based Self-Play. AAAI 2024: 12910-12918
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/LiuYM00Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuYM00Z24
Qihan Liu, Jianing Ye, Xiaoteng Ma, Jun Yang, Bin Liang, Chongjie Zhang:
Efficient Multi-agent Reinforcement Learning by Planning. ICLR 2024
[c22]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/LyuMWL0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LyuMWL0L24
Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Li Xiu, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. ICLR 2024
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LiangMB0ZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiangMB0ZZ24
Zhipeng Liang, Xiaoteng Ma, José H. Blanchet, Jun Yang, Jiheng Zhang, Zhengyuan Zhou:
Single-Trajectory Distributionally Robust Reinforcement Learning. ICML 2024
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/infocom/MaLPTYTMMM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/infocom/MaLPTYTMMM24
Xiaoteng Ma, Qing Li, Junkun Peng, Gareth Tyson, Ziwen Ye, Shisong Tang, Qian Ma, Shengbin Meng, Gabriel-Miro Muntean:
Smart Data-Driven Proactive Push to Edge Network for User-Generated Videos. INFOCOM 2024: 511-520
[c19]
- view
  - electronic edition @ usenix.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/usenix/Ye0QM0MMYM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/Ye0QM0MMYM24
Ziwen Ye, Qing Li, Chunyu Qiao, Xiaoteng Ma, Yong Jiang, Qian Ma, Shengbin Meng, Zhenhui Yuan, Zili Meng:
KEPC-Push: A Knowledge-Enhanced Proactive Content Push Strategy for Edge-Assisted Video Feed Streaming. USENIX ATC 2024: 321-338
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-03807
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-03807
Jiafei Lyu, Xiaoteng Ma, Le Wan, Runze Liu, Xiu Li, Zongqing Lu:
SEABO: A Simple Search-Based Method for Offline Imitation Learning. CoRR abs/2402.03807 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-11778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-11778
Qihan Liu, Jianing Ye, Xiaoteng Ma, Jun Yang, Bin Liang, Chongjie Zhang:
Efficient Multi-agent Reinforcement Learning by Planning. CoRR abs/2405.11778 (2024)
2023
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/eor/MaMX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/eor/MaMX23
Shuai Ma, Xiaoteng Ma, Li Xia:
A unified algorithm framework for mean-variance optimization in discounted Markov decision processes. Eur. J. Oper. Res. 311(3): 1057-1067 (2023)
[j6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tbc/YeLMZJMYM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbc/YeLMZJMYM23
Ziwen Ye, Qing Li, Xiaoteng Ma, Dan Zhao, Yong Jiang, Lianbo Ma, Bo Yi, Gabriel-Miro Muntean:
VRCT: A Viewport Reconstruction-Based 360° Video Caching Solution for Tile-Adaptive Streaming. IEEE Trans. Broadcast. 69(3): 691-703 (2023)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/ecai/ZhangLMY0WL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/ZhangLMY0WL23
Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li:
Uncertainty-Driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning. ECAI 2023: 3018-3025
[c17]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/YangYM0Z023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YangYM0Z023
Rui Yang, Lin Yong, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? ICML 2023: 39543-39571
[c16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/MaMXZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/MaMXZ23
Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao:
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning (Extended Abstract). IJCAI 2023: 6925-6930
[c15]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/XuBMWZW0L23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XuBMWZW0L23
Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. NeurIPS 2023
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11721
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11721
Zhipeng Liang, Xiaoteng Ma, Jose H. Blanchet, Jiheng Zhang, Zhengyuan Zhou:
Single-Trajectory Distributionally Robust Reinforcement Learning. CoRR abs/2301.11721 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-04660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-04660
Junjie Zhang, Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Jun Yang, Le Wan, Xiu Li:
Uncertainty-driven Trajectory Truncation for Model-based Offline Reinforcement Learning. CoRR abs/2304.04660 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11476
Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao:
Learning Diverse Risk Preferences in Population-based Self-play. CoRR abs/2305.11476 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17625
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17625
Kang Xu, Chenjia Bai, Xiaoteng Ma, Dong Wang, Bin Zhao, Zhen Wang, Xuelong Li, Wei Li:
Cross-Domain Policy Adaptation via Value-Guided Data Filtering. CoRR abs/2305.17625 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18882
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18882
Rui Yang, Yong Lin, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? CoRR abs/2305.18882 (2023)
2022
[j5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jair/MaMXZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jair/MaMXZ22
Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao:
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning. J. Artif. Intell. Res. 75: 569-595 (2022)
[j4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tbc/MaLZPZCJM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbc/MaLZPZCJM22
Xiaoteng Ma, Qing Li, Longhao Zou, Junkun Peng, Jianer Zhou, Jimeng Chai, Yong Jiang, Gabriel-Miro Muntean:
QAVA: QoE-Aware Adaptive Video Bitrate Aggregation for HTTP Live Streaming Based on Smart Edge Computing. IEEE Trans. Broadcast. 68(3): 661-676 (2022)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tnsm/MaLJMZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnsm/MaLJMZ22
Xiaoteng Ma, Qing Li, Yong Jiang, Gabriel-Miro Muntean, Longhao Zou:
Learning-Based Joint QoE Optimization for Adaptive Video Streaming Based on Smart Edge. IEEE Trans. Netw. Serv. Manag. 19(2): 1789-1806 (2022)
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LyuMYL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LyuMYL22
Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Xiu Li:
Efficient Continuous Control with Double Actors and Regularized Critics. AAAI 2022: 7655-7663
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iclr/MaYH0ZZLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MaYH0ZZLL22
Xiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu:
Offline Reinforcement Learning with Value-based Episodic Memory. ICLR 2022
[c12]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/LyuMLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LyuMLL22
Jiafei Lyu, Xiaoteng Ma, Xiu Li, Zongqing Lu:
Mildly Conservative Q-Learning for Offline Reinforcement Learning. NeurIPS 2022
[c11]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/SunH0MGZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SunH0MGZ22
Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou:
Exploit Reward Shifting in Value-Based Deep-RL: Optimistic Curiosity-Based Exploration and Conservative Exploitation via Linear Reward Shaping. NeurIPS 2022
[c10]
- view
  - electronic edition @ nips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangBMWZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangBMWZH22
Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. NeurIPS 2022
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/www/TangLMGWJMZC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/TangLMGWJMZC22
Shisong Tang, Qing Li, Xiaoteng Ma, Ci Gao, Dingmin Wang, Yong Jiang, Qian Ma, Aoyang Zhang, Hechang Chen:
Knowledge-based Temporal Fusion Network for Interpretable Online Video Popularity Prediction. WWW 2022: 2879-2887
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/www/PengLMJDHC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/PengLMJDHC22
Junkun Peng, Qing Li, Xiaoteng Ma, Yong Jiang, Yutao Dong, Chuang Hu, Meng Chen:
MagNet: Cooperative Edge Caching by Automatic Content Congregating. WWW 2022: 3280-3288
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05737
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05737
Shuai Ma, Xiaoteng Ma, Li Xia:
A unified algorithm framework for mean-variance optimization in discounted Markov decision processes. CoRR abs/2201.05737 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-02829
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-02829
Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. CoRR abs/2206.02829 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-04745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-04745
Jiafei Lyu, Xiaoteng Ma, Xiu Li, Zongqing Lu:
Mildly Conservative Q-Learning for Offline Reinforcement Learning. CoRR abs/2206.04745 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07376
Xiaoteng Ma, Shuai Ma, Li Xia, Qianchuan Zhao:
Mean-Semivariance Policy Optimization via Risk-Averse Reinforcement Learning. CoRR abs/2206.07376 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06620
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06620
Xiaoteng Ma, Zhipeng Liang, Jose H. Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou:
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation. CoRR abs/2209.06620 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07288
Hao Sun, Lei Han, Rui Yang, Xiaoteng Ma, Jian Guo, Bolei Zhou:
Exploiting Reward Shifting in Value-Based Deep RL. CoRR abs/2209.07288 (2022)
2021
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/ral/ZhangMYLYLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ral/ZhangMYLYLL21
Qiyuan Zhang, Xiaoteng Ma, Yiqin Yang, Chenghao Li, Jun Yang, Yu Liu, Bin Liang:
Learning to Discover Task-Relevant Features for Interpretable Reinforcement Learning. IEEE Robotics Autom. Lett. 6(4): 6601-6607 (2021)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tbc/ZhangLCMZJXM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tbc/ZhangLCMZJXM21
Aoyang Zhang, Qing Li, Ying Chen, Xiaoteng Ma, Longhao Zou, Yong Jiang, Zhimin Xu, Gabriel-Miro Muntean:
Video Super-Resolution and Caching - An Edge-Assisted Adaptive Video Streaming Solution. IEEE Trans. Broadcast. 67(4): 799-812 (2021)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/atal/MaY0LZY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/atal/MaY0LZY21
Xiaoteng Ma, Yiqin Yang, Chenghao Li, Yiwen Lu, Qianchuan Zhao, Jun Yang:
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning. AAMAS 2021: 853-861
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/MaTX0Z21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/MaTX0Z21
Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang, Qianchuan Zhao:
Average-Reward Reinforcement Learning with Trust Region Methods. IJCAI 2021: 2797-2803
[c5]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/YangMLZZHYZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangMLZZHYZ21
Yiqin Yang, Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao:
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning. NeurIPS 2021: 10299-10312
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06042
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06042
Xiaoteng Ma, Yiqin Yang, Chenghao Li, Yiwen Lu, Qianchuan Zhao, Yang Jun:
Modeling the Interaction between Agents in Cooperative Multi-Agent Reinforcement Learning. CoRR abs/2102.06042 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03050
Jiafei Lyu, Xiaoteng Ma, Jiangpeng Yan, Xiu Li:
Efficient Continuous Control with Double Actors and Regularized Critics. CoRR abs/2106.03050 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03400
Yiqin Yang, Xiaoteng Ma, Chenghao Li, Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao:
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning. CoRR abs/2106.03400 (2021)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-03442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-03442
Xiaoteng Ma, Xiaohang Tang, Li Xia, Jun Yang, Qianchuan Zhao:
Average-Reward Reinforcement Learning with Trust Region Methods. CoRR abs/2106.03442 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-03302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-03302
Kailai Sun, Xiaoteng Ma, Qianchuan Zhao, Peng Liu:
MGPSN: Motion-Guided Pseudo Siamese Network for Indoor Video Head Detection. CoRR abs/2110.03302 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-09796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-09796
Xiaoteng Ma, Yiqin Yang, Hao Hu, Qihan Liu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang:
Offline Reinforcement Learning with Value-based Episodic Memory. CoRR abs/2110.09796 (2021)
2020
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/case/0002MXZY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/case/0002MXZY20
Chenghao Li, Xiaoteng Ma, Li Xia, Qianchuan Zhao, Jun Yang:
Fairness Control of Traffic Light via Deep Reinforcement Learning. CASE 2020: 652-658
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2004-14547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-14547
Xiaoteng Ma, Qiyuan Zhang, Li Xia, Zhengyuan Zhou, Jun Yang, Qianchuan Zhao:
Distributional Soft Actor Critic for Risk Sensitive Learning. CoRR abs/2004.14547 (2020)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-03503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-03503
Ming Zhang, Yawei Wang, Xiaoteng Ma, Li Xia, Jun Yang, Zhiheng Li, Xiu Li:
Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration. CoRR abs/2006.03503 (2020)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2006-14363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-14363
Chenghao Li, Xiaoteng Ma, Chongjie Zhang, Jun Yang, Li Xia, Qianchuan Zhao:
SOAC: The Soft Option Actor-Critic Architecture. CoRR abs/2006.14363 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/ccta/LongMJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ccta/LongMJ19
Teng Long, Xiaoteng Ma, Qing-Shan Jia:
Bi-level Proximal Policy optimization for Stochastic Coordination of EV Charging Load with Uncertain Wind Power. CCTA 2019: 302-307
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/nossdav/MaLCXXJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nossdav/MaLCXXJ19
Xiaoteng Ma, Qing Li, Jimeng Chai, Xi Xiao, Shu-Tao Xia, Yong Jiang:
Steward: smart edge based joint QoE optimization for adaptive video streaming. NOSSDAV 2019: 31-36
2018
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icscib/SunZZM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icscib/SunZZM18
Kailai Sun, Qianchuan Zhao, Jianhong Zou, Xiaoteng Ma:
Attendance and Security System Based on Building Video Surveillance. ICSCIB 2018: 153-162

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.