default search action
Yali Du 0001
Person information
- affiliation: King's College London, London, UK
- affiliation (PhD 2019): University of Technology Sydney, Faculty of Engineering and Information Technology, Ultimo, NSW, Australia
Other persons with the same name
- Yali Du 0002 — Nanjing University, China
- Yali Du 0003 — Peking University Third Hospital, Peking University, Beijing, China
- Yali Du 0004 — Xi'an Jiaotong University, Shaanxi Engineering Research Center of Nondestructive Testing and Structural Integrity Evaluation, China
- Yali Du 0005 — Hebei University of Technology, Tianjin, China
- Yali Du 0006 — Xi'an Polytechnic University, Xi'an, China
- Yali Du 0008 — Shandong University, Qingdao, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j12]Ming Yang, Kaiyan Zhao, Yiming Wang, Renzhi Dong, Yali Du, Furui Liu, Mingliang Zhou, Leong Hou U:
Team-wise effective communication in multi-agent reinforcement learning. Auton. Agents Multi Agent Syst. 38(2): 36 (2024) - [j11]Yang Li, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan:
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination. J. Artif. Intell. Res. 80: 1139-1185 (2024) - [j10]Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Alois Knoll:
A Review of Safe Reinforcement Learning: Methods, Theories, and Applications. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11216-11235 (2024) - [j9]Xingzhou Lou, Junge Zhang, Yali Du, Chao Yu, Zhaofeng He, Kaiqi Huang:
Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games. IEEE Trans. Games 16(2): 470-482 (2024) - [c47]Sirui Chen, Zhaowei Zhang, Yaodong Yang, Yali Du:
STAS: Spatial-Temporal Return Decomposition for Solving Sparse Rewards Problems in Multi-agent Reinforcement Learning. AAAI 2024: 17337-17345 - [c46]Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du:
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient. AAAI 2024: 17496-17504 - [c45]Zijing Shi, Meng Fang, Ling Chen, Yali Du, Jun Wang:
Human-Guided Moral Decision Making in Text-Based Games. AAAI 2024: 21574-21582 - [c44]Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du:
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models. AAMAS 2024: 1274-1282 - [c43]Stefan Roesch, Stefanos Leonardos, Yali Du:
The Selfishness Level of Social Dilemmas. AAMAS 2024: 2441-2443 - [c42]Mark Towers, Yali Du, Christopher T. Freeman, Timothy J. Norman:
Explaining an Agent's Future Beliefs Through Temporally Decomposing Future Reward Estimators. ECAI 2024: 2790-2797 - [c41]Mark Towers, Yali Du, Christopher T. Freeman, Timothy J. Norman:
Temporal Explanations of Deep Reinforcement Learning Agents. EXTRAAMAS 2024: 99-115 - [c40]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation. ICML 2024 - [c39]Wenxi Wu, Fabio Pierazzi, Yali Du, Martim Brandão:
Characterizing Physical Adversarial Attacks on Robot Motion Planners. ICRA 2024: 14319-14325 - [c38]Jinyu Cai, Yunhe Zhang, Jicong Fan, Yali Du, Wenzhong Guo:
Dual Contrastive Graph-Level Clustering with Multiple Cluster Perspectives Alignment. IJCAI 2024: 3770-3779 - [c37]Ruiqing Chen, Xiaoyuan Zhang, Yali Du, Yifan Zhong, Zheng Tian, Fanglei Sun, Yaodong Yang:
Off-Agent Trust Region Policy Optimization. IJCAI 2024: 3798-3806 - [i44]Xingzhou Lou, Junge Zhang, Ziyan Wang, Kaiqi Huang, Yali Du:
Safe Reinforcement Learning with Free-form Natural Language Constraints and Pre-Trained Language Models. CoRR abs/2401.07553 (2024) - [i43]Nam Phuong Tran, The-Anh Ta, Shuqing Shi, Debmalya Mandal, Yali Du, Long Tran-Thanh:
Learning the Expected Core of Strictly Convex Stochastic Cooperative Games. CoRR abs/2402.07067 (2024) - [i42]Xidong Feng, Ziyu Wan, Mengyue Yang, Ziyan Wang, Girish A. Koushik, Yali Du, Ying Wen, Jun Wang:
Natural Language Reinforcement Learning. CoRR abs/2402.07157 (2024) - [i41]Zhixun Chen, Yali Du, David Mguni:
All Language Models Large and Small. CoRR abs/2402.12061 (2024) - [i40]Yang Li, Wenhao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan:
Aligning Individual and Collective Objectives in Multi-Agent Cooperation. CoRR abs/2402.12416 (2024) - [i39]Zangir Iklassov, Yali Du, Farkhad Akimov, Martin Takác:
Self-Guiding Exploration for Combinatorial Problems. CoRR abs/2405.17950 (2024) - [i38]Xuanfa Jin, Ziyan Wang, Yali Du, Meng Fang, Haifeng Zhang, Jun Wang:
Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf. CoRR abs/2405.19946 (2024) - [i37]Ziyan Wang, Meng Fang, Tristan Tomilin, Fei Fang, Yali Du:
Safe Multi-agent Reinforcement Learning with Natural Language Constraints. CoRR abs/2405.20018 (2024) - [i36]Mark Towers, Yali Du, Christopher T. Freeman, Timothy J. Norman:
Explaining an Agent's Future Beliefs through Temporally Decomposing Future Reward Estimators. CoRR abs/2408.08230 (2024) - [i35]Ruiqi Zhang, Jing Hou, Florian Walter, Shangding Gu, Jiayi Guan, Florian Röhrbein, Yali Du, Panpan Cai, Guang Chen, Alois Knoll:
Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey. CoRR abs/2408.09675 (2024) - 2023
- [j8]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Gangyan Xu, Chengqi Zhang:
Shared dynamics learning for large-scale traveling salesman problem. Adv. Eng. Informatics 56: 102005 (2023) - [j7]Shangding Gu, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois C. Knoll, Yaodong Yang:
Safe multi-agent reinforcement learning for multi-robot control. Artif. Intell. 319: 103905 (2023) - [j6]Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Jan Peters, Alois Knoll:
A human-centered safe robot reinforcement learning framework with interactive behaviors. Frontiers Neurorobotics 17 (2023) - [c36]Yali Du:
Cooperative Multi-Agent Learning in a Complex World: Challenges and Solutions. AAAI 2023: 15436 - [c35]Zhijian Duan, Wenhan Huang, Dinghuai Zhang, Yali Du, Jun Wang, Yaodong Yang, Xiaotie Deng:
Is Nash Equilibrium Approximator Learnable? AAMAS 2023: 233-241 - [c34]Xingzhou Lou, Jiaxian Guo, Junge Zhang, Jun Wang, Kaiqi Huang, Yali Du:
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination. AAMAS 2023: 679-688 - [c33]Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang:
Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank. CIKM 2023: 1004-1013 - [c32]Ming Yang, Renzhi Dong, Yiming Wang, Furui Liu, Yali Du, Mingliang Zhou, Leong Hou U:
TieComm: Learning a Hierarchical Communication Topology Based on Tie Theory. DASFAA (1) 2023: 604-613 - [c31]Zijing Shi, Meng Fang, Yunqiu Xu, Ling Chen, Yali Du:
Stay Moral and Explore: Learn to Behave Morally in Text-based Games. ICLR 2023 - [c30]Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan:
Cooperative Open-ended Learning Framework for Zero-Shot Coordination. ICML 2023: 20470-20484 - [c29]Yabin Zhang, Weiqi Shao, Xu Chen, Yali Du, Xiaoxiao Xu, Dong Zheng, Changhua Pei, Shuai Zhang, Peng Jiang, Kun Gai:
A Multi-Agent Framework for Recommendation with Heterogeneous Sources. IJCNN 2023: 1-8 - [c28]Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy:
Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach. NeurIPS 2023 - [c27]Shutong Ding, Jingya Wang, Yali Du, Ye Shi:
Reduced Policy Optimization for Continuous Control with Hard Constraints. NeurIPS 2023 - [c26]Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang:
ChessGPT: Bridging Policy Learning and Language Modeling. NeurIPS 2023 - [c25]Xue Yan, Jiaxian Guo, Xingzhou Lou, Jun Wang, Haifeng Zhang, Yali Du:
An Efficient End-to-End Training Approach for Zero-Shot Human-AI Coordination. NeurIPS 2023 - [c24]Mengyue Yang, Yonggang Zhang, Zhen Fang, Yali Du, Furui Liu, Jean-Francois Ton, Jianhong Wang, Jun Wang:
Invariant Learning via Probability of Sufficient and Necessary Causes. NeurIPS 2023 - [i34]Xingzhou Lou, Jiaxian Guo, Junge Zhang, Jun Wang, Kaiqi Huang, Yali Du:
PECAN: Leveraging Policy Ensemble for Context-Aware Zero-Shot Human-AI Coordination. CoRR abs/2301.06387 (2023) - [i33]Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni:
Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning. CoRR abs/2302.03439 (2023) - [i32]Yang Li, Shao Zhang, Jichen Sun, Yali Du, Ying Wen, Xinbing Wang, Wei Pan:
Cooperative Open-ended Learning Framework for Zero-shot Coordination. CoRR abs/2302.04831 (2023) - [i31]Shangding Gu, Alap Kshirsagar, Yali Du, Guang Chen, Yaodong Yang, Jan Peters, Alois C. Knoll:
A Human-Centered Safe Robot Reinforcement Learning Framework with Interactive Behaviors. CoRR abs/2302.13137 (2023) - [i30]Sirui Chen, Zhaowei Zhang, Yali Du, Yaodong Yang:
STAS: Spatial-Temporal Return Decomposition for Multi-agent Reinforcement Learning. CoRR abs/2304.07520 (2023) - [i29]Liting Chen, Lu Wang, Hang Dong, Yali Du, Jie Yan, Fangkai Yang, Shuang Li, Pu Zhao, Si Qin, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang:
Introspective Tips: Large Language Model for In-Context Decision Making. CoRR abs/2305.11598 (2023) - [i28]Yudi Zhang, Yali Du, Biwei Huang, Ziyan Wang, Jun Wang, Meng Fang, Mykola Pechenizkiy:
GRD: A Generative Approach for Interpretable Reward Redistribution in Reinforcement Learning. CoRR abs/2305.18427 (2023) - [i27]Yang Li, Shao Zhang, Jichen Sun, Wenhao Zhang, Yali Du, Ying Wen, Xinbing Wang, Wei Pan:
Tackling Cooperative Incompatibility for Zero-Shot Human-AI Coordination. CoRR abs/2306.03034 (2023) - [i26]Runze Liu, Yali Du, Fengshuo Bai, Jiafei Lyu, Xiu Li:
Zero-shot Preference Learning for Offline RL via Optimal Transport. CoRR abs/2306.03615 (2023) - [i25]Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang:
ChessGPT: Bridging Policy Learning and Language Modeling. CoRR abs/2306.09200 (2023) - [i24]Jiarui Jin, Xianyu Chen, Weinan Zhang, Mengyue Yang, Yang Wang, Yali Du, Yong Yu, Jun Wang:
Replace Scoring with Arrangement: A Contextual Set-to-Arrangement Framework for Learning-to-Rank. CoRR abs/2308.02860 (2023) - [i23]Mengyue Yang, Zhen Fang, Yonggang Zhang, Yali Du, Furui Liu, Jean-Francois Ton, Jun Wang:
Invariant Learning via Probability of Sufficient and Necessary Causes. CoRR abs/2309.12559 (2023) - [i22]Shutong Ding, Jingya Wang, Yali Du, Ye Shi:
Reduced Policy Optimization for Continuous Control with Hard Constraints. CoRR abs/2310.09574 (2023) - [i21]Richard Willis, Yali Du, Joel Z. Leibo, Michael Luck:
Resolving social dilemmas with minimal reward transfer. CoRR abs/2310.12928 (2023) - [i20]Ziyan Wang, Yali Du, Yudi Zhang, Meng Fang, Biwei Huang:
MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment. CoRR abs/2312.03644 (2023) - [i19]Yali Du, Joel Z. Leibo, Usman Islam, Richard Willis, Peter Sunehag:
A Review of Cooperation in Multi-agent Learning. CoRR abs/2312.05162 (2023) - [i18]Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du:
TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient. CoRR abs/2312.15667 (2023) - [i17]Zijing Shi, Meng Fang, Shunfeng Zheng, Shilong Deng, Ling Chen, Yali Du:
Cooperation on the Fly: Exploring Language Agents for Ad Hoc Teamwork in the Avalon Game. CoRR abs/2312.17515 (2023) - 2022
- [j5]Elizabeth Black, Martim Brandão, Oana Cocarascu, Bart de Keijzer, Yali Du, Derek Long, Michael Luck, Peter McBurney, Albert Meroño-Peñuela, Simon Miles, Sanjay Modgil, Luc Moreau, Maria Polukarov, Odinaldo Rodrigues, Carmine Ventre:
Reasoning and interaction for social artificial intelligence. AI Commun. 35(4): 309-325 (2022) - [j4]Tianhong Dai, Yali Du, Meng Fang, Anil Anthony Bharath:
Diversity-augmented intrinsic motivation for deep reinforcement learning. Neurocomputing 468: 396-406 (2022) - [j3]Yunqiu Xu, Meng Fang, Ling Chen, Gangyan Xu, Yali Du, Chengqi Zhang:
Reinforcement Learning With Multiple Relational Attention for Solving Vehicle Routing Problems. IEEE Trans. Cybern. 52(10): 11107-11120 (2022) - [c23]Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen:
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach. AAAI 2022: 8797-8805 - [c22]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang:
Perceiving the World: Question-guided Reinforcement Learning for Text-based Games. ACL (1) 2022: 538-560 - [c21]Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu:
GCS: Graph-Based Coordination Strategy for Multi-Agent Reinforcement Learning. AAMAS 2022: 1128-1136 - [c20]Ilias Kazantzidis, Timothy J. Norman, Yali Du, Christopher T. Freeman:
How to Train Your Agent: Active Learning from Human Preferences and Justifications in Safety-critical Environments. AAMAS 2022: 1654-1656 - [c19]Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang:
Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL. ICLR 2022 - [c18]Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Scalable Model-based Policy Optimization for Decentralized Networked Systems. IROS 2022: 9019-9026 - [c17]Runze Liu, Fengshuo Bai, Yali Du, Yaodong Yang:
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning. NeurIPS 2022 - [i16]Xue Yan, Yali Du, Binxin Ru, Jun Wang, Haifeng Zhang, Xu Chen:
Learning to Identify Top Elo Ratings: A Dueling Bandits Approach. CoRR abs/2201.04480 (2022) - [i15]Jingqing Ruan, Yali Du, Xuantang Xiong, Dengpeng Xing, Xiyun Li, Linghui Meng, Haifeng Zhang, Jun Wang, Bo Xu:
GCS: Graph-based Coordination Strategy for Multi-Agent Reinforcement Learning. CoRR abs/2201.06257 (2022) - [i14]Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang:
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL. CoRR abs/2202.04478 (2022) - [i13]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang:
Perceiving the World: Question-guided Reinforcement Learning for Text-based Games. CoRR abs/2204.09597 (2022) - [i12]Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois C. Knoll:
A Review of Safe Reinforcement Learning: Methods, Theory and Applications. CoRR abs/2205.10330 (2022) - [i11]Yali Du, Chengdong Ma, Yuchen Liu, Runji Lin, Hao Dong, Jun Wang, Yaodong Yang:
Fully Decentralized Model-based Policy Optimization for Networked Systems. CoRR abs/2207.06559 (2022) - [i10]Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang:
Contextual Transformer for Offline Meta Reinforcement Learning. CoRR abs/2211.08016 (2022) - [i9]Yang Yang, Hongjian Sun, Jialei Gong, Yali Du, Di Yu:
Interpretable Dimensionality Reduction by Feature Preserving Manifold Approximation and Projection. CoRR abs/2211.09321 (2022) - 2021
- [c16]Yali Du, Bo Liu, Vincent Moens, Ziqi Liu, Zhicheng Ren, Jun Wang, Xu Chen, Haifeng Zhang:
Learning Correlated Communication Topology in Multi-Agent Reinforcement learning. AAMAS 2021: 456-464 - [c15]Liheng Chen, Hongyi Guo, Yali Du, Fei Fang, Haifeng Zhang, Weinan Zhang, Yong Yu:
Signal Instructed Coordination in Cooperative Multi-agent Reinforcement Learning. DAI 2021: 185-205 - [c14]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Chengqi Zhang:
Generalization in Text-based Games via Hierarchical Reinforcement Learning. EMNLP (Findings) 2021: 1343-1353 - [c13]Yali Du, Xue Yan, Xu Chen, Jun Wang, Haifeng Zhang:
Estimating α-Rank from A Few Entries with Low Rank Matrix Completion. ICML 2021: 2870-2879 - [c12]David Henry Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. ICML 2021: 7688-7699 - [c11]Xiaoqiang Wang, Yali Du, Shengyu Zhu, Liangjun Ke, Zhitang Chen, Jianye Hao, Jun Wang:
Ordering-Based Causal Discovery with Reinforcement Learning. IJCAI 2021: 3566-3573 - [c10]Xu Chen, Yali Du, Long Xia, Jun Wang:
Reinforcement Recommendation with User Multi-aspect Preference. WWW 2021: 425-435 - [i8]David Mguni, Jianhong Wang, Taher Jafferjee, Nicolas Perez Nieves, Wenbin Song, Yaodong Yang, Feifei Tong, Hui Chen, Jiangcheng Zhu, Yali Du, Jun Wang:
Learning to Shape Rewards using a Game of Switching Controls. CoRR abs/2103.09159 (2021) - [i7]David Mguni, Yutong Wu, Yali Du, Yaodong Yang, Ziyi Wang, Minne Li, Ying Wen, Joel Jennings, Jun Wang:
Learning in Nonzero-Sum Stochastic Games with Potentials. CoRR abs/2103.09284 (2021) - [i6]Xiaoqiang Wang, Yali Du, Shengyu Zhu, Liangjun Ke, Zhitang Chen, Jianye Hao, Jun Wang:
Ordering-Based Causal Discovery with Reinforcement Learning. CoRR abs/2105.06631 (2021) - [i5]Rui Yang, Meng Fang, Lei Han, Yali Du, Feng Luo, Xiu Li:
MHER: Model-based Hindsight Experience Replay. CoRR abs/2107.00306 (2021) - [i4]Zhijian Duan, Yali Du, Jun Wang, Xiaotie Deng:
Learning to Compute Approximate Nash Equilibrium for Normal-form Games. CoRR abs/2108.07472 (2021) - [i3]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Chengqi Zhang:
Generalization in Text-based Games via Hierarchical Reinforcement Learning. CoRR abs/2109.09968 (2021) - 2020
- [c9]Yifan Zhao, Gangyan Xu, Yali Du, Meng Fang:
Learning Multi-Agent Communication with Policy Fingerprints for Adaptive Traffic Signal Control. CASE 2020: 266-273 - [c8]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang:
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games. NeurIPS 2020 - [i2]Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou, Chengqi Zhang:
Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games. CoRR abs/2010.11655 (2020)
2010 – 2019
- 2019
- [b1]Yali Du:
Design and evaluation of factorization-based algorithms for user preference analysis. University of Technology Sydney, Australia, 2019 - [j2]Yali Du, Meng Fang, Jinfeng Yi, Chang Xu, Jun Cheng, Dacheng Tao:
Enhancing the Robustness of Neural Collaborative Filtering Systems Under Malicious Attacks. IEEE Trans. Multim. 21(3): 555-565 (2019) - [c7]Lei Han, Peng Sun, Yali Du, Jiechao Xiong, Qing Wang, Xinghai Sun, Han Liu, Tong Zhang:
Grid-Wise Control for Multi-Agent Reinforcement Learning in Video Game AI. ICML 2019: 2576-2585 - [c6]Xun Wang, Yali Du, Leimin Zhang, Xirong Li, Miao Zhang, Jianfeng Dong:
Exploring Content-based Video Relevance for Video Click-Through Rate Prediction. ACM Multimedia 2019: 2602-2606 - [c5]Yali Du, Lei Han, Meng Fang, Ji Liu, Tianhong Dai, Dacheng Tao:
LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning. NeurIPS 2019: 4405-4416 - [c4]Meng Fang, Tianyi Zhou, Yali Du, Lei Han, Zhengyou Zhang:
Curriculum-guided Hindsight Experience Replay. NeurIPS 2019: 12602-12613 - 2018
- [j1]Yali Du, Chang Xu, Dacheng Tao:
Matrix Factorization for Collaborative Budget Allocation. IEEE Trans Autom. Sci. Eng. 15(4): 1471-1482 (2018) - [c3]Yali Du, Meng Fang, Jinfeng Yi, Jun Cheng, Dacheng Tao:
Towards Query Efficient Black-box Attacks: An Input-free Perspective. AISec@CCS 2018: 13-24 - [i1]Yali Du, Meng Fang, Jinfeng Yi, Jun Cheng, Dacheng Tao:
Towards Query Efficient Black-box Attacks: An Input-free Perspective. CoRR abs/1809.02918 (2018) - 2017
- [c2]Yali Du, Chang Xu, Dacheng Tao:
Privileged Matrix Factorization for Collaborative Filtering. IJCAI 2017: 1610-1616 - [c1]Yali Du, Chang Xu, Dacheng Tao:
Collaborative Rating Allocation. IJCAI 2017: 1617-1623
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-22 20:40 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint