fsdp_qlora icon indicating copy to clipboard operation
fsdp_qlora copied to clipboard

How to fine-tune a Vision Language Model (VLM)?

Open asmith26 opened this issue 5 months ago • 0 comments

Hi there,

Just wondering, does this repo support fine-tuning a Vision Language Model (VLM), e.g https://huggingface.co/microsoft/Phi-3.5-vision-instruct?

Many thanks for any help, and for this amazing lib!

asmith26 avatar Aug 31 '24 15:08 asmith26