Okapi
Okapi copied to clipboard
Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback
I wanted to check if you trained RLHF models for English as well. If so, could you share those models as well?
Issue Description: I've encountered an issue while downloading the English language data. Specifically, the dataset appears to include only the 52K English instructions, omitting the multilingual-ranking-data-42k and multilingual-rl-tuning-64k datasets in...
Hi! Thank you for your awesome work! I had a few doubts: - I understand you have finetuned on all languages separately: [https://huggingface.co/uonlp](https://huggingface.co/uonlp). I was curious if you had attempted...