Computer Science > Machine Learning

arXiv:2406.04421 (cs)

[Submitted on 6 Jun 2024]

Title:Enhancing Supervised Visualization through Autoencoder and Random Forest Proximities for Out-of-Sample Extension

Authors:Shuang Ni, Adrien Aumon, Guy Wolf, Kevin R. Moon, Jake S. Rhodes

Abstract:The value of supervised dimensionality reduction lies in its ability to uncover meaningful connections between data features and labels. Common dimensionality reduction methods embed a set of fixed, latent points, but are not capable of generalizing to an unseen test set. In this paper, we provide an out-of-sample extension method for the random forest-based supervised dimensionality reduction method, RF-PHATE, combining information learned from the random forest model with the function-learning capabilities of autoencoders. Through quantitative assessment of various autoencoder architectures, we identify that networks that reconstruct random forest proximities are more robust for the embedding extension problem. Furthermore, by leveraging proximity-based prototypes, we achieve a 40% reduction in training time without compromising extension quality. Our method does not require label information for out-of-sample points, thus serving as a semi-supervised method, and can achieve consistent quality using only 10% of the training data.

Comments:	7 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2406.04421 [cs.LG]
	(or arXiv:2406.04421v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.04421

Submission history

From: Jake Rhodes [view email]
[v1] Thu, 6 Jun 2024 18:06:50 UTC (328 KB)

Computer Science > Machine Learning

Title:Enhancing Supervised Visualization through Autoencoder and Random Forest Proximities for Out-of-Sample Extension

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enhancing Supervised Visualization through Autoencoder and Random Forest Proximities for Out-of-Sample Extension

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators