indicnlp_catalog icon indicating copy to clipboard operation
indicnlp_catalog copied to clipboard

Resources mentioned in LREC 2020 papers

Open anoopkunchukuttan opened this issue 3 years ago • 3 comments

Text

  • Bangla Fake News; http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.349.pdf
    • https://github.com/Rowan1697/FakeNews
  • Urdu Fake News: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.309.pdf
    • https://github.com/MaazAmjad/Urdu-News-Augmented-Dataset
  • Urdu Lexical text simplification: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.428.pdf
    • https://github.com/NamoosQasmi/SimplifyUR
  • Inflection corpus: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.481.pdf
    • https://github.com/lenakmeth/Wikinflection-Corpus
  • Cognate dataset: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.378.pdf
    • https://github.com/dipteshkanojia/challengeCognateFF
  • Discourse modes in Hindi: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.149.pdf
    • (already in catalog)
  • Event Extraction in Hindi: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.273.pdf
  • Bangla Discourse Connectives: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.138.pdf
  • Sindhi NER data: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.361.pdf
  • Telugu Aspect based sentiment analysis: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.617.pdf
  • Odia Sentiment Analysis: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.339.pdf
  • A seed corpus of Hindu temples in India (Could be possibly usefuole for QA in Indian English:) http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.32.pdf
  • CVIT Parallel Corpus (already in catalog)
  • CVIT Speech Corpus (already in catalog)
  • Dakshina Dataset (already in catalog)
  • WikiPron (already in catalog)

Speech

  • Bangla Speech: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.811.pdf
  • CommonVoice: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.520.pdf
  • CMU Wilderness Dataset for speech from Bible: https://github.com/festvox/datasets-CMU_Wilderness
  • Indian English Pronunciation dict: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.812.pdf
  • Google Speech Corpus (added to catalogs): http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.800.pdf.

anoopkunchukuttan avatar Aug 23 '20 14:08 anoopkunchukuttan

The Treebank of Vedic Sanskrit: https://www.aclweb.org/anthology/2020.lrec-1.632.pdf SHR++: An Interface for Morpho-syntactic annotation of Sanskrit Corpora (Code and Demo available for the tool): http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.874.pdf

@anoopkunchukuttan

krishnamrith12 avatar Sep 05 '20 06:09 krishnamrith12

Thanks Amrith, will add these as well to the catalog

anoopkunchukuttan avatar Sep 10 '20 09:09 anoopkunchukuttan

@krishnamrith12 , added to catalog. As a small acknowledgment - I have added you the list of contributors to the catalog.

anoopkunchukuttan avatar Oct 01 '20 14:10 anoopkunchukuttan