umap icon indicating copy to clipboard operation
umap copied to clipboard

Alternative network architecture for Parametric UMAP

Open 16thomja opened this issue 2 years ago • 1 comments

I am projecting 10s of millions of 200-dimensional text embeddings into 3 dimensions with the intent of clustering in the reduced space. Under my memory constraints, training PUMAP on a subset of these embeddings appears to be the best option. Can anyone suggest alternative NN architectures for this purpose? I am unsure whether the default 3-layer 100-neuron is sufficient.

16thomja avatar Aug 24 '22 15:08 16thomja

maybe an auto-encoder of sorts comes to mind but not rlly sure

Mirage-deux avatar Aug 24 '22 19:08 Mirage-deux