Cited By
View all- Shang WLi QQin ZYu YMeng YYe J(2021)Partially observable environment estimation with uplift inference for reinforcement learning based recommendationMachine Learning10.1007/s10994-021-05969-wOnline publication date: 14-Apr-2021
We study the problem of self-interested planning under uncertainty in settings shared with more than a thousand other agents, each of which plans at its own individual level. We refer to such large numbers of agents as an agent population. The decision-...
Modeling other agents' behaviors plays an important role in decision models for interactions among multiple agents. To optimize its own decisions, a subject agent needs to model what other agents act simultaneously in an uncertain environment. ...
Association for Computing Machinery
New York, NY, United States
Check if you have access through your login credentials or your institution to get full access on this article.
Sign in