openhathi_instruct
openhathi_instruct copied to clipboard
This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resulting model is meant to follow instructions and chat in Hindi and H...
hey @pacman100 the translation code could take days given there are multiple SFT datasets and multiple languages to translate on. is there a way to accelerate the code by launching...
Whenever there is use of Hindi prompts, the model gives incorrect, unreliable and sometimes irrelevant information
AutoTokenizer and LlamaTokenizer (which Sarvam used) both behave differently with this model. AutoTokenizer sometimes splits words that are in vocab and LlamaTokenizer works fine. https://huggingface.co/sarvamai/OpenHathi-7B-Hi-v0.1-Base/discussions/5