Dimension reduction technique for visualization and general non-linear reduction
Top 6.7% on sourcepulse
UMAP (Uniform Manifold Approximation and Projection) is a powerful dimension reduction technique for visualization and general non-linear dimensionality reduction. It's designed for users needing to explore high-dimensional data, offering a faster and often more globally representative alternative to t-SNE, with added capabilities like density preservation and supervised learning.
How It Works
UMAP models data with a fuzzy topological structure, aiming to find a low-dimensional projection that closely matches this structure. It leverages Riemannian geometry and fuzzy simplicial sets, with performance optimized by the numba
library and the optional pynndescent
library for nearest neighbor searches. This approach allows UMAP to preserve both local and global data structure effectively.
Quick Start & Requirements
pip install umap-learn
or conda install -c conda-forge umap-learn
.pynndescent
.Highlighted Details
densMAP
for preserving local density and Parametric UMAP for neural network-based transformations.transform
for new data and supervised/semi-supervised learning.Maintenance & Community
The project is actively maintained by Leland McInnes and contributors. Community support is available via GitHub Issues.
Licensing & Compatibility
Limitations & Caveats
While UMAP is robust, the densMAP variant recommends larger n_neighbors
(e.g., 30) for reliable density estimation. Parametric UMAP is noted as experimental.
1 week ago
Inactive