multilingual-data-stats
multilingual-data-stats copied to clipboard
Maybe add vocab size column?
For extra feelgoodness, vocab size based on various frequency thresholds ("unique words with >=100 appearances" and such).