RLAIF-V
RLAIF-V copied to clipboard
The LoRA training codes and scripts
A significant achievement in aligning Vision-Language Models!
While running the code 'RLAIF-V/muffin/train/train_llava15.py', I noticed that all model parameters are trainable. Due to hardware limitations, could you kindly provide the LoRA training codes, similar to LLaVA?