RecursiveHierarchicalClustering icon indicating copy to clipboard operation
RecursiveHierarchicalClustering copied to clipboard

Cluster quality results?

Open gmobile15 opened this issue 5 years ago • 0 comments

Hi,

Massive thanks for this great tool and it works absolutely fine with my data. In the paper, you mentioned that you experimented with different values of k to create the k-gram sequences. What metric would you recommend to evaluate these clusters?

For e.g. if I experiment with k -> [1,2,3,4,5] I would have 5 set of results (assuming I dont include time gaps at this stage, as that would double the number of results). How would I decide which clustering is the best? Is it simply the modularity score? If yes, each cluster has a modularity value but is there a way to amalgamate that for an entire set of results?

gmobile15 avatar Aug 27 '20 13:08 gmobile15