default search action
Chongjie Zhang
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j6]Siyuan Li, Hao Li, Jin Zhang, Zhen Wang, Peng Liu, Chongjie Zhang:
IOB: integrating optimization transfer and behavior transfer for multi-policy reuse. Auton. Agents Multi Agent Syst. 38(1): 3 (2024) - [c77]Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang:
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption. ICLR 2024 - [c76]Heng Dong, Junyu Zhang, Chongjie Zhang:
Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design. ICLR 2024 - [c75]Yuyang Liu, Weijun Dong, Yingdong Hu, Chuan Wen, Zhao-Heng Yin, Chongjie Zhang, Yang Gao:
Imitation Learning from Observation with Automatic Discount Scheduling. ICLR 2024 - [c74]Qihan Liu, Jianing Ye, Xiaoteng Ma, Jun Yang, Bin Liang, Chongjie Zhang:
Efficient Multi-agent Reinforcement Learning by Planning. ICLR 2024 - [c73]Yihuan Mao, Chengjie Wu, Xi Chen, Hao Hu, Ji Jiang, Tianze Zhou, Tangjie Lv, Changjie Fan, Zhipeng Hu, Yi Wu, Yujing Hu, Chongjie Zhang:
Stylized Offline Reinforcement Learning: Extracting Diverse High-Quality Behaviors from Heterogeneous Datasets. ICLR 2024 - [c72]Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang:
Bayesian Design Principles for Offline-to-Online Reinforcement Learning. ICML 2024 - [c71]Chengjie Wu, Hao Hu, Yiqin Yang, Ning Zhang, Chongjie Zhang:
Planning, Fast and Slow: Online Reinforcement Learning with Action-Free Offline Data via Multiscale Planners. ICML 2024 - [c70]Chao Li, Yujing Hu, Shangdong Yang, Tangjie Lv, Changjie Fan, Wenbin Li, Chongjie Zhang, Yang Gao:
STAR: Spatio-Temporal State Compression for Multi-Agent Tasks with Rich Observations. IJCAI 2024: 120-128 - [i59]Michael Lanier, Ying Xu, Nathan Jacobs, Chongjie Zhang, Yevgeniy Vorobeychik:
Learning Interpretable Policies in Hindsight-Observable POMDPs through Partially Supervised Reinforcement Learning. CoRR abs/2402.09290 (2024) - [i58]Qihan Liu, Jianing Ye, Xiaoteng Ma, Jun Yang, Bin Liang, Chongjie Zhang:
Efficient Multi-agent Reinforcement Learning by Planning. CoRR abs/2405.11778 (2024) - [i57]Hao Hu, Yiqin Yang, Jianing Ye, Chengjie Wu, Ziqing Mai, Yujing Hu, Tangjie Lv, Changjie Fan, Qianchuan Zhao, Chongjie Zhang:
Bayesian Design Principles for Offline-to-Online Reinforcement Learning. CoRR abs/2405.20984 (2024) - [i56]Anindya Sarkar, Srikumar Sastry, Aleksis Pirinen, Chongjie Zhang, Nathan Jacobs, Yevgeniy Vorobeychik:
GOMAA-Geo: GOal Modality Agnostic Active Geo-localization. CoRR abs/2406.01917 (2024) - 2023
- [j5]Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu, Deheng Ye:
A Survey on Transformers in Reinforcement Learning. Trans. Mach. Learn. Res. 2023 (2023) - [c69]Yiqin Yang, Hao Hu, Wenzhe Li, Siyuan Li, Jun Yang, Qianchuan Zhao, Chongjie Zhang:
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery. AAAI 2023: 10843-10851 - [c68]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
The Provable Benefit of Unsupervised Data Sharing for Offline Reinforcement Learning. ICLR 2023 - [c67]Heng Dong, Junyu Zhang, Tonghan Wang, Chongjie Zhang:
Symmetry-Aware Robot Design with Structured Subgroups. ICML 2023: 8334-8355 - [c66]Jianhao Wang, Jin Zhang, Haozhe Jiang, Junyu Zhang, Liwei Wang, Chongjie Zhang:
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation. ICML 2023: 36626-36669 - [c65]Rui Yang, Lin Yong, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? ICML 2023: 39543-39571 - [c64]Ruiqi Zhu, Siyuan Li, Tianhong Dai, Chongjie Zhang, Oya Çeliktutan:
Learning to Solve Tasks with Exploring Prior Behaviours. IROS 2023: 7501-7507 - [c63]Hao Hu, Yiqin Yang, Jianing Ye, Ziqing Mai, Chongjie Zhang:
Unsupervised Behavior Extraction via Random Intent Priors. NeurIPS 2023 - [c62]Chengjie Wu, Pingzhong Tang, Jun Yang, Yujing Hu, Tangjie Lv, Changjie Fan, Chongjie Zhang:
Conservative Offline Policy Adaptation in Multi-Agent Games. NeurIPS 2023 - [i55]Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu, Deheng Ye:
A Survey on Transformers in Reinforcement Learning. CoRR abs/2301.03044 (2023) - [i54]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning. CoRR abs/2302.13493 (2023) - [i53]Rui Yang, Yong Lin, Xiaoteng Ma, Hao Hu, Chongjie Zhang, Tong Zhang:
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? CoRR abs/2305.18882 (2023) - [i52]Jianhao Wang, Jin Zhang, Haozhe Jiang, Junyu Zhang, Liwei Wang, Chongjie Zhang:
Offline Meta Reinforcement Learning with In-Distribution Online Adaptation. CoRR abs/2305.19529 (2023) - [i51]Heng Dong, Junyu Zhang, Tonghan Wang, Chongjie Zhang:
Symmetry-Aware Robot Design with Structured Subgroups. CoRR abs/2306.00036 (2023) - [i50]Ruiqi Zhu, Siyuan Li, Tianhong Dai, Chongjie Zhang, Oya Çeliktutan:
Learning to Solve Tasks with Exploring Prior Behaviours. CoRR abs/2307.02889 (2023) - [i49]Siyuan Li, Hao Li, Jin Zhang, Zhen Wang, Peng Liu, Chongjie Zhang:
IOB: Integrating Optimization Transfer and Behavior Transfer for Multi-Policy Reuse. CoRR abs/2308.07351 (2023) - [i48]Chenghao Li, Tonghan Wang, Chongjie Zhang, Qianchuan Zhao:
Never Explore Repeatedly in Multi-Agent Reinforcement Learning. CoRR abs/2308.09909 (2023) - [i47]Yuyang Liu, Weijun Dong, Yingdong Hu, Chuan Wen, Zhao-Heng Yin, Chongjie Zhang, Yang Gao:
Imitation Learning from Observation with Automatic Discount Scheduling. CoRR abs/2310.07433 (2023) - [i46]Rui Yang, Han Zhong, Jiawei Xu, Amy Zhang, Chongjie Zhang, Lei Han, Tong Zhang:
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption. CoRR abs/2310.12955 (2023) - [i45]Hao Hu, Yiqin Yang, Jianing Ye, Ziqing Mai, Chongjie Zhang:
Unsupervised Behavior Extraction via Random Intent Priors. CoRR abs/2310.18687 (2023) - [i44]Heng Dong, Junyu Zhang, Chongjie Zhang:
Leveraging Hyperbolic Embeddings for Coarse-to-Fine Robot Design. CoRR abs/2311.00462 (2023) - 2022
- [c61]Lei Yuan, Jianhao Wang, Fuxiang Zhang, Chenghe Wang, Zongzhang Zhang, Yang Yu, Chongjie Zhang:
Multi-Agent Incentive Communication via Decentralized Teammate Modeling. AAAI 2022: 9466-9474 - [c60]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. ICLR 2022 - [c59]Siyuan Li, Jin Zhang, Jianhao Wang, Yang Yu, Chongjie Zhang:
Active Hierarchical Exploration with Stable Subgoal Representation Learning. ICLR 2022 - [c58]Xiaoteng Ma, Yiqin Yang, Hao Hu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang, Qihan Liu:
Offline Reinforcement Learning with Value-based Episodic Memory. ICLR 2022 - [c57]Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang:
Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL. ICLR 2022 - [c56]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
On the Role of Discount Factor in Offline Reinforcement Learning. ICML 2022: 9072-9098 - [c55]Li Wang, Yupeng Zhang, Yujing Hu, Weixun Wang, Chongjie Zhang, Yang Gao, Jianye Hao, Tangjie Lv, Changjie Fan:
Individual Reward Assisted Multi-Agent Reinforcement Learning. ICML 2022: 23417-23432 - [c54]Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang:
Self-Organized Polynomial-Time Coordination Graphs. ICML 2022: 24963-24979 - [c53]Lei Yuan, Chenghe Wang, Jianhao Wang, Fuxiang Zhang, Feng Chen, Cong Guan, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Concentrative Coordination with Decentralized Task Representation. IJCAI 2022: 599-605 - [c52]Jin Zhang, Siyuan Li, Chongjie Zhang:
CUP: Critic-Guided Policy Reuse. NeurIPS 2022 - [c51]Xi Chen, Ali Ghadirzadeh, Tianhe Yu, Jianhao Wang, Alex Yuan Gao, Wenzhe Li, Liang Bin, Chelsea Finn, Chongjie Zhang:
LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning. NeurIPS 2022 - [c50]Heng Dong, Tonghan Wang, Jiayuan Liu, Chongjie Zhang:
Low-Rank Modular Reinforcement Learning via Muscle Synergy. NeurIPS 2022 - [c49]Yipeng Kang, Tonghan Wang, Qianlan Yang, Xiaoran Wu, Chongjie Zhang:
Non-Linear Coordination Graphs. NeurIPS 2022 - [c48]Mingyang Liu, Chengjie Wu, Qihan Liu, Yansen Jing, Jun Yang, Pingzhong Tang, Chongjie Zhang:
Safe Opponent-Exploitation Subgame Refinement. NeurIPS 2022 - [c47]Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. NeurIPS 2022 - [i43]Yihuan Mao, Chao Wang, Bin Wang, Chongjie Zhang:
MOORe: Model-based Offline-to-Online Reinforcement Learning. CoRR abs/2201.10070 (2022) - [i42]Rui Yang, Yiming Lu, Wenzhe Li, Hao Sun, Meng Fang, Yali Du, Xiu Li, Lei Han, Chongjie Zhang:
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL. CoRR abs/2202.04478 (2022) - [i41]Rongjun Qin, Feng Chen, Tonghan Wang, Lei Yuan, Xiaoran Wu, Zongzhang Zhang, Chongjie Zhang, Yang Yu:
Multi-Agent Policy Transfer via Task Relationship Modeling. CoRR abs/2203.04482 (2022) - [i40]Xi Chen, Ali Ghadirzadeh, Tianhe Yu, Yuan Gao, Jianhao Wang, Wenzhe Li, Bin Liang, Chelsea Finn, Chongjie Zhang:
Latent-Variable Advantage-Weighted Policy Optimization for Offline RL. CoRR abs/2203.08949 (2022) - [i39]Rui Yang, Chenjia Bai, Xiaoteng Ma, Zhaoran Wang, Chongjie Zhang, Lei Han:
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing. CoRR abs/2206.02829 (2022) - [i38]Hao Hu, Yiqin Yang, Qianchuan Zhao, Chongjie Zhang:
On the Role of Discount Factor in Offline Reinforcement Learning. CoRR abs/2206.03383 (2022) - [i37]Jianing Ye, Chenghao Li, Jianhao Wang, Chongjie Zhang:
Towards Global Optimality in Cooperative MARL with Sequential Transformation. CoRR abs/2207.11143 (2022) - [i36]Jin Zhang, Siyuan Li, Chongjie Zhang:
CUP: Critic-Guided Policy Reuse. CoRR abs/2210.08153 (2022) - [i35]Heng Dong, Tonghan Wang, Jiayuan Liu, Chongjie Zhang:
Low-Rank Modular Reinforcement Learning via Muscle Synergy. CoRR abs/2210.15479 (2022) - [i34]Yipeng Kang, Tonghan Wang, Xiaoran Wu, Qianlan Yang, Chongjie Zhang:
Non-Linear Coordination Graphs. CoRR abs/2211.08404 (2022) - [i33]Xiaoran Wu, Zihan Yan, Chongjie Zhang, Tongshuang Wu:
Decisions that Explain Themselves: A User-Centric Deep Reinforcement Learning Explanation System. CoRR abs/2212.00888 (2022) - [i32]Yiqin Yang, Hao Hu, Wenzhe Li, Siyuan Li, Jun Yang, Qianchuan Zhao, Chongjie Zhang:
Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery. CoRR abs/2212.01105 (2022) - 2021
- [c46]Tonghan Wang, Tarun Gupta, Anuj Mahajan, Bei Peng, Shimon Whiteson, Chongjie Zhang:
RODE: Learning Roles to Decompose Multi-Agent Tasks. ICLR 2021 - [c45]Siyuan Li, Lulu Zheng, Jianhao Wang, Chongjie Zhang:
Learning Subgoal Representations with Slow Dynamics. ICLR 2021 - [c44]Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang:
DOP: Off-Policy Multi-Agent Decomposed Policy Gradients. ICLR 2021 - [c43]Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. ICLR 2021 - [c42]Hao Hu, Jianing Ye, Guangxiang Zhu, Zhizhou Ren, Chongjie Zhang:
Generalizable Episodic Memory for Deep Reinforcement Learning. ICML 2021: 4380-4390 - [c41]Jin Zhang, Jianhao Wang, Hao Hu, Tong Chen, Yingfeng Chen, Changjie Fan, Chongjie Zhang:
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration. ICML 2021: 12600-12610 - [c40]Zhaorong Wang, Meng Wang, Jingqi Zhang, Yingfeng Chen, Chongjie Zhang:
Reward-Constrained Behavior Cloning. IJCAI 2021: 3169-3175 - [c39]Lulu Zheng, Jiarui Chen, Jianhao Wang, Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang:
Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration. NeurIPS 2021: 3757-3769 - [c38]Chenghao Li, Tonghan Wang, Chengjie Wu, Qianchuan Zhao, Jun Yang, Chongjie Zhang:
Celebrating Diversity in Shared Multi-Agent Reinforcement Learning. NeurIPS 2021: 3991-4002 - [c37]Yao Mu, Yuzheng Zhuang, Bin Wang, Guangxiang Zhu, Wulong Liu, Jianyu Chen, Ping Luo, Shengbo Li, Chongjie Zhang, Jianye Hao:
Model-Based Reinforcement Learning via Imagination with Derived Memory. NeurIPS 2021: 9493-9505 - [c36]Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang:
On the Estimation Bias in Double Q-Learning. NeurIPS 2021: 10246-10259 - [c35]Jianhao Wang, Zhizhou Ren, Beining Han, Jianing Ye, Chongjie Zhang:
Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization. NeurIPS 2021: 29142-29155 - [c34]Jianhao Wang, Wenzhe Li, Haozhe Jiang, Guangxiang Zhu, Siyuan Li, Chongjie Zhang:
Offline Reinforcement Learning with Reverse Model-based Imagination. NeurIPS 2021: 29420-29432 - [i31]Hao Hu, Jianing Ye, Zhizhou Ren, Guangxiang Zhu, Chongjie Zhang:
Generalizable Episodic Memory for Deep Reinforcement Learning. CoRR abs/2103.06469 (2021) - [i30]Heng Dong, Tonghan Wang, Jiayuan Liu, Chongjie Zhang:
Birds of a Feather Flock Together: A Close Look at Cooperation Emergence via Multi-Agent RL. CoRR abs/2104.11455 (2021) - [i29]Siyuan Li, Jin Zhang, Jianhao Wang, Chongjie Zhang:
Efficient Hierarchical Exploration with Stable Subgoal Representation Learning. CoRR abs/2105.14750 (2021) - [i28]Chenghao Li, Chengjie Wu, Tonghan Wang, Jun Yang, Qianchuan Zhao, Chongjie Zhang:
Celebrating Diversity in Shared Multi-Agent Reinforcement Learning. CoRR abs/2106.02195 (2021) - [i27]Tonghan Wang, Liang Zeng, Weijun Dong, Qianlan Yang, Yang Yu, Chongjie Zhang:
Context-Aware Sparse Deep Coordination Graphs. CoRR abs/2106.02886 (2021) - [i26]Jiahan Cao, Lei Yuan, Jianhao Wang, Shaowei Zhang, Chongjie Zhang, Yang Yu, De-Chuan Zhan:
LINDA: Multi-Agent Local Information Decomposition for Awareness of Teammates. CoRR abs/2109.12508 (2021) - [i25]Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang:
On the Estimation Bias in Double Q-Learning. CoRR abs/2109.14419 (2021) - [i24]Jianhao Wang, Wenzhe Li, Haozhe Jiang, Guangxiang Zhu, Siyuan Li, Chongjie Zhang:
Offline Reinforcement Learning with Reverse Model-based Imagination. CoRR abs/2110.00188 (2021) - [i23]Siyang Wu, Tonghan Wang, Chenghao Li, Chongjie Zhang:
Containerized Distributed Value-Based Multi-Agent Reinforcement Learning. CoRR abs/2110.08169 (2021) - [i22]Xiaoteng Ma, Yiqin Yang, Hao Hu, Qihan Liu, Jun Yang, Chongjie Zhang, Qianchuan Zhao, Bin Liang:
Offline Reinforcement Learning with Value-based Episodic Memory. CoRR abs/2110.09796 (2021) - [i21]Lulu Zheng, Jiarui Chen, Jianhao Wang, Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang:
Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration. CoRR abs/2111.11032 (2021) - [i20]Qianlan Yang, Weijun Dong, Zhizhou Ren, Jianhao Wang, Tonghan Wang, Chongjie Zhang:
Self-Organized Polynomial-Time Coordination Graphs. CoRR abs/2112.03547 (2021) - 2020
- [c33]Guangxiang Zhu, Jianhao Wang, Zhizhou Ren, Zichuan Lin, Chongjie Zhang:
Object-Oriented Dynamics Learning through Multi-Level Abstraction. AAAI 2020: 6989-6998 - [c32]Yaohui Guo, Chongjie Zhang, X. Jessie Yang:
Modeling Trust Dynamics in Human-robot Teaming: A Bayesian Inference Approach. CHI Extended Abstracts 2020: 1-7 - [c31]Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang:
Influence-Based Multi-Agent Exploration. ICLR 2020 - [c30]Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang:
Learning Nearly Decomposable Value Functions Via Communication Minimization. ICLR 2020 - [c29]Guangxiang Zhu, Zichuan Lin, Guangwen Yang, Chongjie Zhang:
Episodic Reinforcement Learning with Associative Memory. ICLR 2020 - [c28]Tonghan Wang, Heng Dong, Victor R. Lesser, Chongjie Zhang:
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles. ICML 2020: 9876-9886 - [c27]Guangxiang Zhu, Minghao Zhang, Honglak Lee, Chongjie Zhang:
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning. NeurIPS 2020 - [i19]Tonghan Wang, Heng Dong, Victor R. Lesser, Chongjie Zhang:
ROMA: Multi-Agent Reinforcement Learning with Emergent Roles. CoRR abs/2003.08039 (2020) - [i18]Jianhao Wang, Zhizhou Ren, Beining Han, Chongjie Zhang:
Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning. CoRR abs/2006.00587 (2020) - [i17]Jin Zhang, Jianhao Wang, Hao Hu, Yingfeng Chen, Changjie Fan, Chongjie Zhang:
Learn to Effectively Explore in Context-Based Meta-RL. CoRR abs/2006.08170 (2020) - [i16]Chenghao Li, Xiaoteng Ma, Chongjie Zhang, Jun Yang, Li Xia, Qianchuan Zhao:
SOAC: The Soft Option Actor-Critic Architecture. CoRR abs/2006.14363 (2020) - [i15]Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang:
Off-Policy Multi-Agent Decomposed Policy Gradients. CoRR abs/2007.12322 (2020) - [i14]Jianhao Wang, Zhizhou Ren, Terry Liu, Yang Yu, Chongjie Zhang:
QPLEX: Duplex Dueling Multi-Agent Q-Learning. CoRR abs/2008.01062 (2020) - [i13]Tonghan Wang, Tarun Gupta, Anuj Mahajan, Bei Peng, Shimon Whiteson, Chongjie Zhang:
RODE: Learning Roles to Decompose Multi-Agent Tasks. CoRR abs/2010.01523 (2020) - [i12]Guangxiang Zhu, Minghao Zhang, Honglak Lee, Chongjie Zhang:
Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning. CoRR abs/2010.12142 (2020) - [i11]Hangtian Jia, Yujing Hu, Yingfeng Chen, Chunxu Ren, Tangjie Lv, Changjie Fan, Chongjie Zhang:
Fever Basketball: A Complex, Flexible, and Asynchronized Sports Game Environment for Multi-agent Reinforcement Learning. CoRR abs/2012.03204 (2020)
2010 – 2019
- 2019
- [c26]Xinliang Song, Tonghan Wang, Chongjie Zhang:
Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games. AAMAS 2019: 935-943 - [c25]Siyuan Li, Fangda Gu, Guangxiang Zhu, Chongjie Zhang:
Context-Aware Policy Reuse. AAMAS 2019: 989-997 - [c24]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Yan Zheng, Chongjie Zhang, Ze Zheng:
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents. AAMAS 2019: 2282-2284 - [c23]Tianpei Yang, Jianye Hao, Zhaopeng Meng, Chongjie Zhang, Yan Zheng, Ze Zheng:
Towards Efficient Detection and Optimal Response against Sophisticated Opponents. IJCAI 2019: 623-629 - [c22]Siyuan Li, Rui Wang, Minxue Tang, Chongjie Zhang:
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards. NeurIPS 2019: 1407-1417 - [i10]Xinliang Song, Tonghan Wang, Chongjie Zhang:
Convergence of Multi-Agent Learning with a Finite Step Size in General-Sum Games. CoRR abs/1903.02868 (2019) - [i9]Guangxiang Zhu, Jianhao Wang, Zhizhou Ren, Chongjie Zhang:
Object-Oriented Dynamics Learning through Multi-Level Abstraction. CoRR abs/1904.07482 (2019) - [i8]Siyuan Li, Rui Wang, Minxue Tang, Chongjie Zhang:
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards. CoRR abs/1910.04450 (2019) - [i7]Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang:
Learning Nearly Decomposable Value Functions Via Communication Minimization. CoRR abs/1910.05366 (2019) - [i6]Tonghan Wang, Jianhao Wang, Yi Wu, Chongjie Zhang:
Influence-Based Multi-Agent Exploration. CoRR abs/1910.05512 (2019) - 2018
- [c21]Siyuan Li, Chongjie Zhang:
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning. AAAI 2018: 3562-3570 - [c20]Guangxiang Zhu, Zhiao Huang, Chongjie Zhang:
Object-Oriented Dynamics Predictor. NeurIPS 2018: 9826-9837 - [i5]Siyuan Li, Fangda Gu, Guangxiang Zhu, Chongjie Zhang:
Context-Aware Policy Reuse. CoRR abs/1806.03793 (2018) - [i4]Guangxiang Zhu, Chongjie Zhang:
Object-Oriented Dynamics Predictor. CoRR abs/1806.07371 (2018) - [i3]Tianpei Yang, Zhaopeng Meng, Jianye Hao, Chongjie Zhang, Yan Zheng:
Bayes-ToMoP: A Fast Detection and Best Response Algorithm Towards Sophisticated Opponents. CoRR abs/1809.04240 (2018) - 2017
- [j4]Ramya Ramakrishnan, Chongjie Zhang, Julie A. Shah:
Perturbation Training for Human-Robot Teams. J. Artif. Intell. Res. 59: 495-541 (2017) - [c19]Daniel Garant, Bruno Castro da Silva, Victor R. Lesser, Chongjie Zhang:
Context-Based Concurrent Experience Sharing in Multiagent Systems. AAMAS 2017: 1544-1546 - [i2]Daniel Garant, Bruno Castro da Silva, Victor R. Lesser, Chongjie Zhang:
Context-Based Concurrent Experience Sharing in Multiagent Systems. CoRR abs/1703.01931 (2017) - [i1]Siyuan Li, Chongjie Zhang:
An Optimal Online Method of Selecting Source Policies for Reinforcement Learning. CoRR abs/1709.08201 (2017) - 2016
- [c18]Chongjie Zhang, Julie A. Shah:
Co-Optimizating Multi-Agent Placement with Task Assignment and Scheduling. IJCAI 2016: 3308-3314 - [c17]Chongjie Zhang, Julie A. Shah:
Co-optimizing task and motion planning. IROS 2016: 4750-4756 - 2015
- [c16]Chongjie Zhang, Julie A. Shah:
On Fairness in Decision-Making under Uncertainty: Definitions, Computation, and Comparison. AAAI 2015: 3642-3648 - 2014
- [c15]Duc Thien Nguyen, William Yeoh, Hoong Chuin Lau, Shlomo Zilberstein, Chongjie Zhang:
Decentralized Multi-Agent Reinforcement Learning in Average-Reward Dynamic DCOPs. AAAI 2014: 1447-1455 - [c14]Duc Thien Nguyen, William Yeoh, Hoong Chuin Lau, Shlomo Zilberstein, Chongjie Zhang:
Decentralized multi-agent reinforcement learning in average-reward dynamic DCOPs. AAMAS 2014: 1341-1342 - [c13]Chongjie Zhang, Julie A. Shah:
Fairness in Multi-Agent Sequential Decision-Making. NIPS 2014: 2636-2644 - 2013
- [c12]Chongjie Zhang, Victor R. Lesser:
Coordinating multi-agent reinforcement learning with limited communication. AAMAS 2013: 1101-1108 - [c11]Daniel D. Corkill, Chongjie Zhang, Bruno Castro da Silva, Yoonheui Kim, Daniel Garant, Victor R. Lesser, Xiaoqin Zhang:
Biasing the behavior of organizationally adept agents: (extended abstract). AAMAS 2013: 1309-1310 - [c10]Xiangbin Zhu, Chongjie Zhang, Victor R. Lesser:
Combining Dynamic Reward Shaping and Action Shaping for Coordinating Multi-agent Learning. IAT 2013: 321-328 - 2012
- [j3]Xiaoqin Zhang, Bhavesh Shrestha, Sung Wook Yoon, Subbarao Kambhampati, Phillip DiBona, Jinhong K. Guo, Daniel McFarlane, Martin O. Hofmann, Kenneth R. Whitebread, Darren Scott Appling, Elizabeth T. Whitaker, Ethan Trewhitt, Li Ding, James Michaelis, Deborah L. McGuinness, James A. Hendler, Janardhan Rao Doppa, Charles Parker, Thomas G. Dietterich, Prasad Tadepalli, Weng-Keen Wong, Derek T. Green, Antons Rebguns, Diana F. Spears, Ugur Kuter, Geoffrey Levine, Gerald DeJong, Reid MacTavish, Santiago Ontañón, Jainarayan Radhakrishnan, Ashwin Ram, Hala Mostafa, Huzaifa Zafar, Chongjie Zhang, Daniel D. Corkill, Victor R. Lesser, Zhexuan Song:
An Ensemble Architecture for Learning Complex Problem-Solving Techniques from Demonstration. ACM Trans. Intell. Syst. Technol. 3(4): 75:1-75:38 (2012) - 2011
- [c9]Chongjie Zhang, Victor R. Lesser:
Coordinated Multi-Agent Reinforcement Learning in Networked Distributed POMDPs. AAAI 2011: 764-770 - 2010
- [c8]Chongjie Zhang, Victor R. Lesser:
Multi-Agent Learning with Policy Prediction. AAAI 2010: 927-934 - [c7]Chongjie Zhang, Victor R. Lesser, Sherief Abdallah:
Self-organization for coordinating decentralized reinforcement learning. AAMAS 2010: 739-746
2000 – 2009
- 2009
- [c6]Chongjie Zhang, Sherief Abdallah, Victor R. Lesser:
Integrating organizational control into multi-agent learning. AAMAS (2) 2009: 757-764 - [c5]Xiaoqin Zhang, Sung Wook Yoon, Phillip DiBona, Darren Scott Appling, Li Ding, Janardhan Rao Doppa, Derek T. Green, Jinhong K. Guo, Ugur Kuter, Geoffrey Levine, Reid MacTavish, Daniel McFarlane, James Michaelis, Hala Mostafa, Santiago Ontañón, Charles Parker, Jainarayan Radhakrishnan, Antons Rebguns, Bhavesh Shrestha, Zhexuan Song, Ethan Trewhitt, Huzaifa Zafar, Chongjie Zhang, Daniel D. Corkill, Gerald DeJong, Thomas G. Dietterich, Subbarao Kambhampati, Victor R. Lesser, Deborah L. McGuinness, Ashwin Ram, Diana F. Spears, Prasad Tadepalli, Elizabeth T. Whitaker, Weng-Keen Wong, James A. Hendler, Martin O. Hofmann, Kenneth R. Whitebread:
An Ensemble Learning and Problem Solving Architecture for Airspace Management. IAAI 2009 - [c4]Chongjie Zhang, Victor R. Lesser, Prashant J. Shenoy:
A Multi-Agent Learning Approach to Online Distributed Resource Allocation. IJCAI 2009: 361-366 - 2008
- [c3]Chongjie Zhang, Sherief Abdallah, Victor R. Lesser:
Efficient multi-agent reinforcement learning through automated supervision. AAMAS (3) 2008: 1365-1370 - 2007
- [j2]Chongjie Zhang, Chirag Dekate, Gabrielle Allen, Ian Kelley, Jon MacLaren:
An application portal for collaborative coastal modeling. Concurr. Comput. Pract. Exp. 19(12): 1571-1581 (2007) - [j1]Chongjie Zhang, Ian Kelley, Gabrielle Allen:
Grid portal solutions: a comparison of GridPortlets and OGCE. Concurr. Comput. Pract. Exp. 19(12): 1739-1748 (2007) - [c2]Gabrielle Allen, Promita Chakraborty, Dayong Huang, Zhou Lei, John Lewis, Xin Li, Christopher D. White, Xiaoxi Xu, Chongjie Zhang:
A workflow approach to designed reservoir study. WORKS@HPDC 2007: 75-79 - 2005
- [c1]Jon MacLaren, Gabrielle Allen, Chirag Dekate, Dayong Huang, Andrei Hutanu, Chongjie Zhang:
Shelter from the Storm: Building a Safe Archive in a Hostile World. OTM Workshops 2005: 294-303
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 21:31 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint