Okapi icon indicating copy to clipboard operation
Okapi copied to clipboard

Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback

Results 3 Okapi issues
Sort by recently updated
recently updated
newest added

I wanted to check if you trained RLHF models for English as well. If so, could you share those models as well?

Issue Description: I've encountered an issue while downloading the English language data. Specifically, the dataset appears to include only the 52K English instructions, omitting the multilingual-ranking-data-42k and multilingual-rl-tuning-64k datasets in...

Hi! Thank you for your awesome work! I had a few doubts: - I understand you have finetuned on all languages separately: [https://huggingface.co/uonlp](https://huggingface.co/uonlp). I was curious if you had attempted...