LLaMA-Factory icon indicating copy to clipboard operation
LLaMA-Factory copied to clipboard

Add Multimodal LLM Finetuning

Open BUAADreamer opened this issue 10 months ago • 0 comments

What does this PR do?

Add finetuning Multimodal-LLM by leveraging AutoModelForVision2Seq and AutoProcessortransformers

This PR is working in progress, need improvement in the future

TODO

  • [ ] LLaVA
  • [ ] Instruct-BLIP

Before submitting

BUAADreamer avatar Apr 23 '24 11:04 BUAADreamer