Computer Science > Robotics

arXiv:2307.15801 (cs)

[Submitted on 28 Jul 2023 (v1), last revised 2 Aug 2023 (this version, v2)]

Title:Primitive Skill-based Robot Learning from Human Evaluative Feedback

Authors:Ayano Hiranaka, Minjune Hwang, Sharon Lee, Chen Wang, Li Fei-Fei, Jiajun Wu, Ruohan Zhang

View PDF

Abstract:Reinforcement learning (RL) algorithms face significant challenges when dealing with long-horizon robot manipulation tasks in real-world environments due to sample inefficiency and safety issues. To overcome these challenges, we propose a novel framework, SEED, which leverages two approaches: reinforcement learning from human feedback (RLHF) and primitive skill-based reinforcement learning. Both approaches are particularly effective in addressing sparse reward issues and the complexities involved in long-horizon tasks. By combining them, SEED reduces the human effort required in RLHF and increases safety in training robot manipulation with RL in real-world settings. Additionally, parameterized skills provide a clear view of the agent's high-level intentions, allowing humans to evaluate skill choices before they are executed. This feature makes the training process even safer and more efficient. To evaluate the performance of SEED, we conducted extensive experiments on five manipulation tasks with varying levels of complexity. Our results show that SEED significantly outperforms state-of-the-art RL algorithms in sample efficiency and safety. In addition, SEED also exhibits a substantial reduction of human effort compared to other RLHF methods. Further details and video results can be found at this https URL.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2307.15801 [cs.RO]
	(or arXiv:2307.15801v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2307.15801

Submission history

From: Minjune Hwang [view email]
[v1] Fri, 28 Jul 2023 20:48:30 UTC (9,978 KB)
[v2] Wed, 2 Aug 2023 06:22:24 UTC (11,820 KB)

Computer Science > Robotics

Title:Primitive Skill-based Robot Learning from Human Evaluative Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Primitive Skill-based Robot Learning from Human Evaluative Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators