Cited By
View all- Hau JDelage EGhavamzadeh MPetrik MOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)On dynamic programming decompositions of static risk measures in Markov decision processesProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3668376(51734-51757)Online publication date: 10-Dec-2023
- Lobo ECousins CZick YPetrik MOh ANaumann TGloberson ASaenko KHardt MLevine S(2023)Percentile criterion optimization in offline reinforcement learningProceedings of the 37th International Conference on Neural Information Processing Systems10.5555/3666122.3666531(9322-9352)Online publication date: 10-Dec-2023
- Lim SMalik IKoyejo SMohamed SAgarwal ABelgrave DCho KOh A(2022)Distributional reinforcement learning for risk-sensitive policiesProceedings of the 36th International Conference on Neural Information Processing Systems10.5555/3600270.3602516(30977-30989)Online publication date: 28-Nov-2022