fsdp_qlora How to fine-tune a Vision Language Model (VLM)?

How to fine-tune a Vision Language Model (VLM)?

Open asmith26 opened this issue 5 months ago • 0 comments

Hi there,

Just wondering, does this repo support fine-tuning a Vision Language Model (VLM), e.g https://huggingface.co/microsoft/Phi-3.5-vision-instruct?

Many thanks for any help, and for this amazing lib!

Aug 31 '24 15:08 asmith26