contentvec icon indicating copy to clipboard operation
contentvec copied to clipboard

What are the pseudo labels?

Open Lukysoon opened this issue 11 months ago • 1 comments

Hi, I thought that ContentVec (as well as HuBert) use k-means algorithm for creating labels. So for what reason we need {train,valid}.km and what exactly they are?

Thank you :-)

Lukysoon avatar Mar 21 '24 15:03 Lukysoon

{train,valid}.km are the labels clustered by k-means

auspicious3000 avatar Mar 21 '24 17:03 auspicious3000