Export Citations
1 Results for: Keyword: experience replay
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
Searched The ACM Guide to Computing Literature (3,790,161 records)|Limit your search to The ACM Full-Text Collection (766,446 records)
Showing 1 - 1of1 Results
- research-articleMay 2024
Foresight Distribution Adjustment for Off-policy Reinforcement Learning
AAMAS '24: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent SystemsPages 317–325Off-policy reinforcement learning algorithms maintain a replay buffer to utilize samples obtained from earlier policies. The sampling strategy that prioritizes certain data in a buffer to train the value function or the policy, has been shown to ...