Cited By
View all- Kim JMin SSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Risk-sensitive policy optimization via predictive CVaR policy gradientProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3693046(24354-24369)Online publication date: 21-Jul-2024
- Chen YZhang XWang SHuang LSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Provable risk-sensitive distributional reinforcement learning with general function approximationProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692374(7748-7791)Online publication date: 21-Jul-2024
- Hau JDelage EGhavamzadeh MPetrik MOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)On dynamic programming decompositions of static risk measures in Markov decision processesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668376(51734-51757)Online publication date: 10-Dec-2023
- Show More Cited By