Cited By
View all- Srinivasan SLanctot MZambaldi VPérolat JTuyls KMunos RBowling M(2018)Actor-critic policy optimization in partially observable multiagent environmentsProceedings of the 32nd International Conference on Neural Information Processing Systems10.5555/3327144.3327261(3426-3439)Online publication date: 3-Dec-2018
- Foerster JChen RAl-Shedivat MWhiteson SAbbeel PMordatch IAndre EKoenig SDastani MSukthankar G(2018)Learning with Opponent-Learning AwarenessProceedings of the 17th International Conference on Autonomous Agents and MultiAgent Systems10.5555/3237383.3237408(122-130)Online publication date: 9-Jul-2018
- Sun FChang YWu YLin SFurman JMarchant GPrice HRossi F(2018)Designing Non-greedy Reinforcement Learning Agents with Diminishing Reward ShapingProceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society10.1145/3278721.3278759(297-302)Online publication date: 27-Dec-2018
- Show More Cited By