Nico de Vos

Results 26 comments of Nico de Vos

Also pinging @Jauntbox (we know you're out there!) for question above.

@hedibejaoui , we are running internal forks of TransmogrifAI and MLeap on Spark 3.1.1, so the bulk of the work has been done. For public release, the MLeap dependency needs...

I've followed the papers by Huang (https://github.com/nicodv/kmodes#huang98), which do the sampling from a normal distribution.. Feel free to make a PR for this. It makes sense to open up the...

Can you be more specific, @hajarhomayouni ?

No, there's no `transform` method currently. `KPrototypes` would have to return both a dissimilarity matrix and a distance matrix then. Wouldn't be too hard to implement, I think. What's your...

Details on category utility: https://msdn.microsoft.com/en-us/magazine/dn198247.aspx

The API of the `kmodes.predict` method indeed needs a `categorical` argument, but `sklearn` does not allow for extra arguments to the `predict` method. This causes `kmodes` to be incompatible with...

Have a look here for how the other dissimilarity functions work: https://github.com/nicodv/kmodes/blob/master/kmodes/util/tests/test_dissim.py Looks like you need to adapt your function to accept 2D vectors, whereas right now it assumes 1d...

Simply by running the clustering for multiple k values, as there currently is no wrapper that does this for you automatically. It would be nice to combine this with the...

I added tests for this scenario, but I can't reproduce this: https://github.com/nicodv/kmodes/commit/f5532e0064207aab4edcb53be509153aa2cf00ac Please provide a fully reproducible example, @RoddyJaques .