indicnlp_catalog icon indicating copy to clipboard operation
indicnlp_catalog copied to clipboard

CEnTam- Corpus

Open sanjanasri opened this issue 2 years ago • 1 comments

Hi,

You can include cEnTam corpus - English & Tamil parallel and monolingual corpus , Corressponding paper: https://aclanthology.org/2020.bucc-1.10.pdf

sanjanasri avatar Aug 04 '22 14:08 sanjanasri

Thanks Sanjanasri. I have a few questions:

  • What are the sources for the corpus? Does it include books and literary sources? A list of sources will be valuable documentation.
  • Where can I get the corpus from?
  • What is the license under which the corpus is released?

anoopkunchukuttan avatar Aug 06 '22 13:08 anoopkunchukuttan