umap How to look at the loss value?

Is there a possibility to print the loss value during the optimization process? Or at least after the last epoch?

It would make sense to add this feature, as it would help to set an appropriate number of epochs when one tunes the algorithm for specific data.

Aug 02 '18 12:08 asanakoy

Probably this wont be easy because of negative sampling, it is endless sourse of gradient proportional to current learning rate amplitude

Aug 02 '18 13:08 vseledkin

@vseledkin is correct, the negative sampling is a method used to avoid ever actually computing the full loss value which is remarkably expensive. I did have a function that could be used to compute loss, but in practice it only scaled to datasets of a few thousand points, so I never included it.

Aug 02 '18 16:08 lmcinnes

@lmcinnes but how can we estimate that the method has converged?

Aug 02 '18 16:08 asanakoy

Convergence is not checked - -instead the optimization is run fora specified number of epochs.

On Thu, Aug 2, 2018 at 12:58 PM Artsiom [email protected] wrote:

@lmcinnes https://github.com/lmcinnes but how can we estimate that the method has converged?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/lmcinnes/umap/issues/100#issuecomment-409996476, or mute the thread https://github.com/notifications/unsubscribe-auth/ALaKBScVUNEE0oqbcmtZUOzYkAfRyAt4ks5uMy-YgaJpZM4VsNWO .

Aug 02 '18 17:08 lmcinnes

But how can I make sure that 300 epoch is not better than 200 epochs? If we had a loss function or at least the gradient norms, than we could infer how many epochs is enough.

Oct 03 '18 16:10 asanakoy

Contributions welcome :)

Sep 17 '19 11:09 sleighsoft

@vseledkin is correct, the negative sampling is a method used to avoid ever actually computing the full loss value which is remarkably expensive. I did have a function that could be used to compute loss, but in practice it only scaled to datasets of a few thousand points, so I never included it.

I don't suppose you would be willing to share it? I'm looking at applications of UMAP on reasonably small data, where having the loss function explicitly is every useful.

Jan 23 '21 07:01 AndLen

+1

Nov 05 '21 21:11 csinva

The Parametric UMAP submodule has a non-parametric module that saves loss. See the bottom figure in this notebook.

https://github.com/lmcinnes/umap/blob/master/notebooks/Parametric_UMAP/06.0-nonparametric-umap.ipynb

Nov 05 '21 22:11 timsainb

umap umap copied to clipboard

How to look at the loss value?

umap
umap copied to clipboard