Hinglish
Hinglish copied to clipboard
Hinglish Text Classification
Bumps [transformers](https://github.com/huggingface/transformers) from 3.3.1 to 4.30.0. Release notes Sourced from transformers's releases. v4.30.0: 100k, Agents improvements, Safetensors core dependency, Swiftformer, Autoformer, MobileViTv2, timm-as-a-backbone 100k Transformers has just reached 100k stars...
Bumps [numpy](https://github.com/numpy/numpy) from 1.19.2 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...
Reference Code: https://github.com/NirantK/Hinglish/blob/2798fda87b9b28fa1d7921203ed466c9fd23a28d/hinglishutils.py#L401-#L408 Recommended approach from Huggingface from [here](https://huggingface.co/transformers/preprocessing.html): ```python batch_sentences = ["Hello I'm a single sentence", "And another sentence", "And the very very last one"] encoded_inputs = tokenizer(batch_sentences) print(encoded_inputs)...
Break down into smaller issues and then assign Step 1: Find Sentiment data for English tweets Step 2: Translate that to Hindi using something like [Textblob](https://github.com/sloria/TextBlob/) Step 3: Transliterate that...