Kilosort icon indicating copy to clipboard operation
Kilosort copied to clipboard

max_cluster_subset recommended values

Open nikhilchandra opened this issue 7 months ago • 3 comments

Hi Jacob,

We noticed the addition of the new parameter "max_cluster_subset" which places an upper limit on the number of spikes used for building the clustering graph. Do you have any intuition yet as to what value(s) to set this parameter to?

Thanks, Nikhil

nikhilchandra avatar May 28 '25 20:05 nikhilchandra

You do not need to set it to anything. The current default, None, uses the existing clustering strategy.

We will likely set the default to be 25,000 in the future. That is unlikely to have a noticeable impact on most 2-3 hour (or shorter) recordings, since it would only have an effect if there is more than 500,000 spikes for a single grouping center (~40um probe section for Neuropixels) assuming cluster_downsampling=20 (the default). For longer recordings, this should significantly reduce runtimes.

jacobpennington avatar May 28 '25 20:05 jacobpennington

When you say "2-3 recordings" do you mean "2-3 hour recordings"?

nikhilchandra avatar May 29 '25 17:05 nikhilchandra

Yes, that was a typo.

jacobpennington avatar May 29 '25 19:05 jacobpennington