Computer Science > Machine Learning

arXiv:2205.15752 (cs)

[Submitted on 31 May 2022 (v1), last revised 4 Jun 2023 (this version, v2)]

Title:Hierarchies of Reward Machines

Authors:Daniel Furelos-Blanco, Mark Law, Anders Jonsson, Krysia Broda, Alessandra Russo

View PDF

Abstract:Reward machines (RMs) are a recent formalism for representing the reward function of a reinforcement learning task through a finite-state machine whose edges encode subgoals of the task using high-level events. The structure of RMs enables the decomposition of a task into simpler and independently solvable subtasks that help tackle long-horizon and/or sparse reward tasks. We propose a formalism for further abstracting the subtask structure by endowing an RM with the ability to call other RMs, thus composing a hierarchy of RMs (HRM). We exploit HRMs by treating each call to an RM as an independently solvable subtask using the options framework, and describe a curriculum-based method to learn HRMs from traces observed by the agent. Our experiments reveal that exploiting a handcrafted HRM leads to faster convergence than with a flat HRM, and that learning an HRM is feasible in cases where its equivalent flat representation is not.

Comments:	Preprint accepted for publication to the 40th International Conference on Machine Learning (ICML-23)
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2205.15752 [cs.LG]
	(or arXiv:2205.15752v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.15752

Submission history

From: Daniel Furelos-Blanco [view email]
[v1] Tue, 31 May 2022 12:39:24 UTC (1,255 KB)
[v2] Sun, 4 Jun 2023 09:07:56 UTC (3,553 KB)

Computer Science > Machine Learning

Title:Hierarchies of Reward Machines

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Hierarchies of Reward Machines

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators