indicnlp_corpus
indicnlp_corpus copied to clipboard
No Frequency Files in Data Download
The Readme text-corpora section mentions
Note
The vocabulary frequency files contain the frequency of all unique tokens in the corpus. Each line contains one word along with frequency delimited by tab.
However, the download links only contain the .txt files with paragraphs and not the frequency files.