indicnlp_catalog Resources mentioned in LREC 2020 papers

Resources mentioned in LREC 2020 papers

Open anoopkunchukuttan opened this issue 3 years ago • 3 comments

Text

Bangla Fake News; http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.349.pdf
- https://github.com/Rowan1697/FakeNews
Urdu Fake News: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.309.pdf
- https://github.com/MaazAmjad/Urdu-News-Augmented-Dataset
Urdu Lexical text simplification: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.428.pdf
- https://github.com/NamoosQasmi/SimplifyUR
Inflection corpus: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.481.pdf
- https://github.com/lenakmeth/Wikinflection-Corpus
Cognate dataset: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.378.pdf
- https://github.com/dipteshkanojia/challengeCognateFF
Discourse modes in Hindi: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.149.pdf
- (already in catalog)
Event Extraction in Hindi: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.273.pdf
Bangla Discourse Connectives: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.138.pdf
Sindhi NER data: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.361.pdf
Telugu Aspect based sentiment analysis: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.617.pdf
Odia Sentiment Analysis: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.339.pdf
A seed corpus of Hindu temples in India (Could be possibly usefuole for QA in Indian English:) http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.32.pdf
CVIT Parallel Corpus (already in catalog)
CVIT Speech Corpus (already in catalog)
Dakshina Dataset (already in catalog)
WikiPron (already in catalog)

Speech

Bangla Speech: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.811.pdf
CommonVoice: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.520.pdf
CMU Wilderness Dataset for speech from Bible: https://github.com/festvox/datasets-CMU_Wilderness
Indian English Pronunciation dict: http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.812.pdf
Google Speech Corpus (added to catalogs): http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.800.pdf.

The Treebank of Vedic Sanskrit: https://www.aclweb.org/anthology/2020.lrec-1.632.pdf SHR++: An Interface for Morpho-syntactic annotation of Sanskrit Corpora (Code and Demo available for the tool): http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.874.pdf

@anoopkunchukuttan

Sep 05 '20 06:09 krishnamrith12

Thanks Amrith, will add these as well to the catalog

Sep 10 '20 09:09 anoopkunchukuttan

@krishnamrith12 , added to catalog. As a small acknowledgment - I have added you the list of contributors to the catalog.

Oct 01 '20 14:10 anoopkunchukuttan

indicnlp_catalog indicnlp_catalog copied to clipboard

Resources mentioned in LREC 2020 papers

indicnlp_catalog
indicnlp_catalog copied to clipboard