nltk_data
nltk_data copied to clipboard
Tatoeba corpora
Tatoeba is a multiple corpora: Text, audio, translation. It's released with a public licence. We could add it within ntlk corpora data download? By the way, I'm interested with the Kabyle corpora as I'm using ntlk for some processing tasks.