Computer Science > Machine Learning

arXiv:2310.18531 (cs)

[Submitted on 27 Oct 2023]

Title:Feature Selection in the Contrastive Analysis Setting

Authors:Ethan Weinberger, Ian Covert, Su-In Lee

View PDF

Abstract:Contrastive analysis (CA) refers to the exploration of variations uniquely enriched in a target dataset as compared to a corresponding background dataset generated from sources of variation that are irrelevant to a given task. For example, a biomedical data analyst may wish to find a small set of genes to use as a proxy for variations in genomic data only present among patients with a given disease (target) as opposed to healthy control subjects (background). However, as of yet the problem of feature selection in the CA setting has received little attention from the machine learning community. In this work we present contrastive feature selection (CFS), a method for performing feature selection in the CA setting. We motivate our approach with a novel information-theoretic analysis of representation learning in the CA setting, and we empirically validate CFS on a semi-synthetic dataset and four real-world biomedical datasets. We find that our method consistently outperforms previously proposed state-of-the-art supervised and fully unsupervised feature selection methods not designed for the CA setting. An open-source implementation of our method is available at this https URL.

Comments:	NeurIPS 2023
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2310.18531 [cs.LG]
	(or arXiv:2310.18531v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2310.18531

Submission history

From: Ethan Weinberger [view email]
[v1] Fri, 27 Oct 2023 23:16:03 UTC (5,550 KB)

Computer Science > Machine Learning

Title:Feature Selection in the Contrastive Analysis Setting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Feature Selection in the Contrastive Analysis Setting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators