firstbacksecondback
316 Results
Poster
|
Wed 7:30 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Oral
|
Wed 6:10 |
Efficiently Computing Nash Equilibria in Adversarial Team Markov Games Fivos Kalogiannis · Ioannis Anagnostides · Ioannis Panageas · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Vaggos Chatziafratis · Stelios Stavroulakis |
|
Poster
|
Tue 7:30 |
Extreme Q-Learning: MaxEnt RL without Entropy Divyansh Garg · Joey Hejna · Matthieu Geist · Stefano Ermon |
|
Oral
|
Tue 7:10 |
Extreme Q-Learning: MaxEnt RL without Entropy Divyansh Garg · Joey Hejna · Matthieu Geist · Stefano Ermon |
|
Poster
|
Solving Continuous Control via Q-learning Tim Seyde · Peter Werner · Wilko Schwarting · Igor Gilitschenski · Martin Riedmiller · Daniela Rus · Markus Wulfmeier |
||
Oral
|
Mon 1:50 |
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar · Rishabh Agarwal · Xinyang Geng · George Tucker · Sergey Levine |
|
Poster
|
Mon 2:30 |
Offline Q-learning on Diverse Multi-Task Data Both Scales And Generalizes Aviral Kumar · Rishabh Agarwal · Xinyang Geng · George Tucker · Sergey Levine |
|
Poster
|
Critic Sequential Monte Carlo Vasileios Lioutas · Jonathan Lavington · Justice Sefas · Matthew Niedoba · Yunpeng Liu · Berend Zwartsenberg · Setareh Dabiri · Frank Wood · Adam Scibior |
||
Poster
|
Wed 7:30 |
Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning Zhendong Wang · Jonathan J Hunt · Mingyuan Zhou |
|
Oral Session
|
Mon 1:00 |
Oral 1 Track 5: Reinforcement Learning |
|
Oral Session
|
Tue 6:00 |
Oral 4 Track 3: Reinforcement Learning I |
|
Oral Session
|
Tue 1:00 |
Oral 3 Track 1: Reinforcement Learning |