rainette icon indicating copy to clipboard operation
rainette copied to clipboard

Unclassified segments when doing simple clustering?

Open gabrielparriaux opened this issue 11 months ago • 2 comments

Hello @juba,

I thought that it was only when doing double clustering that there was this option not to force the classification of some segments and that some segments got a NA value as classification.

But I was surprised to do a simple clustering, get back the clusters with cutree_rainette() and have some of them being NA.

Did I do something wrong? Is it normal that some segments are not classified when performing a simple rainette clustering?

Thanks a lot for your opinion about that and all the best!

Gabriel

gabrielparriaux avatar Mar 17 '24 17:03 gabrielparriaux

In general when a document gets NA as cluster in a simple classification it doesn't have any content (it is very short and/or only consisted of terms that have been filtered out).

juba avatar Mar 19 '24 11:03 juba

Oh, ok! So, it must be the reason… I will check the content of those segments!

gabrielparriaux avatar Mar 19 '24 13:03 gabrielparriaux