planning icon indicating copy to clipboard operation
planning copied to clipboard

Tidyclust

Open kbodwin opened this issue 4 years ago • 3 comments
trafficstars

This is my writeup with some details about clustering (unsupervised learning) and how I envision that fitting into the tidymodels framework.

kbodwin avatar Feb 26 '21 00:02 kbodwin

Just wanted to comment that for unsupervised methods there is often both a forward and a backward transformation. Less so with clustering, but for many PCA-like tools. I previously brought this up at little in tidymodels/recipes#264.

Another possible consideration before starting to prototype things is the difference between inductive/transductive models, or methods that can be applied to a new dataset versus those that cannot.

alexpghayes avatar Mar 17 '21 18:03 alexpghayes

Heyo! really excited to see this developing. I've recently fallen into the world of unsupervised clustering (via some gnar text projects) and have been having a hard time understanding the literature and not being able to find certain methods in tidymodels.

michaelgaunt404 avatar Mar 25 '21 05:03 michaelgaunt404

Very cool! Looking forward to developments here and this functionality coming to tidymodels!

I posted a toy solution on SO for validating kmeans cluster partition stability on a holdout set: https://stackoverflow.com/a/68845111/9059865 . (For anyone stumbling onto this thread and looking for something simple in the interim before {celery} 😊 gets implemented in tidymodels.)

brshallo avatar Sep 20 '21 23:09 brshallo

Closing this as we already this this :)

EmilHvitfeldt avatar Sep 06 '24 18:09 EmilHvitfeldt