indicnlp_catalog
indicnlp_catalog copied to clipboard
UMSAB, the Unified Multilingual Sentiment Analysis Benchmark - contains hindi sentiment data
Dataset link: https://github.com/cardiffnlp/xlm-t/tree/main/data/sentiment/hindi
^ it is in Latin script and contains a mix of English and Hindi Twitter tweets.