pygraphistry icon indicating copy to clipboard operation
pygraphistry copied to clipboard

[FEA] UMAP umap_learn integration metaissue

Open lmeyerov opened this issue 3 years ago • 2 comments

Is your feature request related to a problem? Please describe.

Using umap_learn with graphistry should be more visible and easier!

See: https://github.com/graphistry/pygraphistry/blob/master/demos/demos_databases_apis/umap_learn/umap_learn.ipynb

Describe the solution you'd like

  • [ ] umap_learn: embedding.plot() <- maintainers already signaled interest
    • [ ] method
    • [ ] docs on their side
  • [x] pygraphistry: g = g.nodes(...).umap(engine='umap_learn', params=...); print(g._embeddings); g.plot()
    • Unclear what input params + outputs for typical cases... maybe just raw parent objs until we understand what to standardize?
  • [ ] attributes: options not just for embedding, but original properties + potentially vectorized features
  • [x] option to autovectorize, nodes & maybe label propagated edges <-- feels like a separate issue
  • [ ] maybe some recommended settings or enrichments, like:
    • compact layout
    • color edges by similarity

Implementation notes

May be easiest if mostly implemented as a shim to our side, so we can update with less friction, and they just version bump whenever ready

Try to keep varargs in most places to be more robust against changes on either side

lmeyerov avatar Dec 18 '21 18:12 lmeyerov

@lmcinnes FYI :)

lmeyerov avatar Dec 18 '21 18:12 lmeyerov

Moving discussion to https://github.com/graphistry/pygraphistry/discussions/300

lmeyerov avatar Jan 14 '22 19:01 lmeyerov