pytorch_kmeans
pytorch_kmeans copied to clipboard

→

Implementation of the k-means algorithm in PyTorch that works for large datasets

This code works for a dataset, as soon as it fits on the GPU. Tested for Python3 and PyTorch 1.0.0.

For simplicity, the clustering procedure stops when the clustering stops updating. In practice, this might be too strict and should be relaxed.

There is a magic constant (search for chunk_size) which should ideally be determined automatically based on the amount of free memory on the GPU.

Implementation of the k-means algorithm in PyTorch that works for large datasets

pytorch

python3

big-data

clustering

k-means

Stars

Forks

Watchers

Stars

Forks

Watchers

Implementation of the k-means algorithm in PyTorch that works for large datasets