default search action
Neurocomputing, Volume 263
Volume 263, November 2017
- Madalina M. Drugan, Marco A. Wiering, Peter Vamplew, Madhu Chetty:
Special issue on multi-objective reinforcement learning. 1-2
- Simone Parisi, Matteo Pirotta, Jan Peters:
Manifold-based multi-objective policy search with sample reuse. 3-14 - Manuela Ruiz-Montiel, Lawrence Mandow, José-Luis Pérez-de-la-Cruz:
A temporal difference method for multi-objective reinforcement learning. 15-25 - Peter Vamplew, Rustam Issabekov, Richard Dazeley, Cameron Foale, Adam Berry, Tim Moore, Douglas C. Creighton:
Steering approaches to Pareto-optimal multiobjective reinforcement learning. 26-38
- Thommen George Karimpanal, Erik Wilhelm:
Identification and off-policy learning of multiple objectives using adaptive clustering. 39-47 - Tim Brys, Anna Harutyunyan, Peter Vrancx, Ann Nowé, Matthew E. Taylor:
Multi-objectivization and ensembles of shapings in reinforcement learning. 48-59
- Patrick Mannion, Sam Devlin, Karl Mason, Jim Duggan, Enda Howley:
Policy invariance under reward transformations for multi-objective reinforcement learning. 60-73 - Peter Vamplew, Richard Dazeley, Cameron Foale:
Softmax exploration strategies for multiobjective reinforcement learning. 74-86
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.