Computer Science > Machine Learning

arXiv:2309.12032 (cs)

[Submitted on 21 Sep 2023 (v1), last revised 1 Nov 2024 (this version, v2)]

Title:Human-in-the-Loop Causal Discovery under Latent Confounding using Ancestral GFlowNets

Authors:Tiago da Silva, Eliezer Silva, António Góis, Dominik Heider, Samuel Kaski, Diego Mesquita, Adèle Ribeiro

Abstract:Structure learning is the crux of causal inference. Notably, causal discovery (CD) algorithms are brittle when data is scarce, possibly inferring imprecise causal relations that contradict expert knowledge -- especially when considering latent confounders. To aggravate the issue, most CD methods do not provide uncertainty estimates, making it hard for users to interpret results and improve the inference process. Surprisingly, while CD is a human-centered affair, no works have focused on building methods that both 1) output uncertainty estimates that can be verified by experts and 2) interact with those experts to iteratively refine CD. To solve these issues, we start by proposing to sample (causal) ancestral graphs proportionally to a belief distribution based on a score function, such as the Bayesian information criterion (BIC), using generative flow networks. Then, we leverage the diversity in candidate graphs and introduce an optimal experimental design to iteratively probe the expert about the relations among variables, effectively reducing the uncertainty of our belief over ancestral graphs. Finally, we update our samples to incorporate human feedback via importance sampling. Importantly, our method does not require causal sufficiency (i.e., unobserved confounders may exist). Experiments with synthetic observational data show that our method can accurately sample from distributions over ancestral graphs and that we can greatly improve inference quality with human aid.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2309.12032 [cs.LG]
	(or arXiv:2309.12032v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.12032

Submission history

From: Diego Mesquita [view email]
[v1] Thu, 21 Sep 2023 12:53:45 UTC (23,293 KB)
[v2] Fri, 1 Nov 2024 16:46:49 UTC (23,286 KB)

Computer Science > Machine Learning

Title:Human-in-the-Loop Causal Discovery under Latent Confounding using Ancestral GFlowNets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Human-in-the-Loop Causal Discovery under Latent Confounding using Ancestral GFlowNets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators