RL4LMs
RL4LMs copied to clipboard
Bloom Supporting
The repository uses transformers version 4.18, which does not support bloom, is there any way to use bloom as the initial policy for training?
The repository uses transformers version 4.18, which does not support bloom, is there any way to use bloom as the initial policy for training?