indicnlp_corpus icon indicating copy to clipboard operation
indicnlp_corpus copied to clipboard

No Frequency Files in Data Download

Open sumeet-iitg opened this issue 2 years ago • 0 comments

The Readme text-corpora section mentions

Note

The vocabulary frequency files contain the frequency of all unique tokens in the corpus. Each line contains one word along with frequency delimited by tab.

However, the download links only contain the .txt files with paragraphs and not the frequency files.

sumeet-iitg avatar Sep 07 '23 18:09 sumeet-iitg