Computer Science > Machine Learning

arXiv:2303.05628 (cs)

[Submitted on 10 Mar 2023 (v1), last revised 3 Oct 2023 (this version, v2)]

Title:On the Unlikelihood of D-Separation

Authors:Itai Feigenbaum, Huan Wang, Shelby Heinecke, Juan Carlos Niebles, Weiran Yao, Caiming Xiong, Devansh Arpit

View PDF

Abstract:Causal discovery aims to recover a causal graph from data generated by it; constraint based methods do so by searching for a d-separating conditioning set of nodes in the graph via an oracle. In this paper, we provide analytic evidence that on large graphs, d-separation is a rare phenomenon, even when guaranteed to exist, unless the graph is extremely sparse. We then provide an analytic average case analysis of the PC Algorithm for causal discovery, as well as a variant of the SGS Algorithm we call UniformSGS. We consider a set $V=\{v_1,\ldots,v_n\}$ of nodes, and generate a random DAG $G=(V,E)$ where $(v_a, v_b) \in E$ with i.i.d. probability $p_1$ if $a<b$ and $0$ if $a > b$. We provide upper bounds on the probability that a subset of $V-\{x,y\}$ d-separates $x$ and $y$, conditional on $x$ and $y$ being d-separable; our upper bounds decay exponentially fast to $0$ as $|V| \rightarrow \infty$. For the PC Algorithm, while it is known that its worst-case guarantees fail on non-sparse graphs, we show that the same is true for the average case, and that the sparsity requirement is quite demanding: for good performance, the density must go to $0$ as $|V| \rightarrow \infty$ even in the average case. For UniformSGS, while it is known that the running time is exponential for existing edges, we show that in the average case, that is the expected running time for most non-existing edges as well.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Methodology (stat.ME)
Cite as:	arXiv:2303.05628 [cs.LG]
	(or arXiv:2303.05628v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.05628

Submission history

From: Itai Feigenbaum [view email]
[v1] Fri, 10 Mar 2023 00:11:18 UTC (14 KB)
[v2] Tue, 3 Oct 2023 14:32:11 UTC (3,292 KB)

Computer Science > Machine Learning

Title:On the Unlikelihood of D-Separation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Unlikelihood of D-Separation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators