Statistics > Machine Learning

arXiv:2306.10983 (stat)

[Submitted on 19 Jun 2023 (v1), last revised 27 Jun 2023 (this version, v2)]

Title:Effect-Invariant Mechanisms for Policy Generalization

Authors:Sorawit Saengkyongam, Niklas Pfister, Predrag Klasnja, Susan Murphy, Jonas Peters

View PDF

Abstract:Policy learning is an important component of many real-world learning systems. A major challenge in policy learning is how to adapt efficiently to unseen environments or tasks. Recently, it has been suggested to exploit invariant conditional distributions to learn models that generalize better to unseen environments. However, assuming invariance of entire conditional distributions (which we call full invariance) may be too strong of an assumption in practice. In this paper, we introduce a relaxation of full invariance called effect-invariance (e-invariance for short) and prove that it is sufficient, under suitable assumptions, for zero-shot policy generalization. We also discuss an extension that exploits e-invariance when we have a small sample from the test environment, enabling few-shot policy generalization. Our work does not assume an underlying causal graph or that the data are generated by a structural causal model; instead, we develop testing procedures to test e-invariance directly from data. We present empirical results using simulated data and a mobile health intervention dataset to demonstrate the effectiveness of our approach.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2306.10983 [stat.ML]
	(or arXiv:2306.10983v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2306.10983

Submission history

From: Sorawit Saengkyongam [view email]
[v1] Mon, 19 Jun 2023 14:50:24 UTC (87 KB)
[v2] Tue, 27 Jun 2023 16:09:11 UTC (89 KB)

Statistics > Machine Learning

Title:Effect-Invariant Mechanisms for Policy Generalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Effect-Invariant Mechanisms for Policy Generalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators