LAVIS icon indicating copy to clipboard operation
LAVIS copied to clipboard

BLIP2 fine-tuning on custom LLM+dataset

Open visheratin opened this issue 1 year ago • 5 comments

Hi!

I want to extend BLIP2 capabilities to another language. I have a pre-trained LLM (T5 family) and a dataset with image captions. Could you please help me understand my next steps to train the model? Do I need to perform pre-training, or can I use pre-trained ViT along with my T5 model and do fine-tuning?

visheratin avatar May 23 '23 15:05 visheratin

Hey,

Looking to do a similar project. Did you ever found an answer to your question? Would love to get some clarification.

joaopedrosdmm avatar Aug 17 '23 10:08 joaopedrosdmm

嘿,

想做一个类似的项目。您找到问题的答案了吗?希望得到一些澄清。

Hello! I am also working on fine-tuning the image capture task using my own dataset, BLIP2. Have you succeeded

shams2023 avatar Nov 06 '23 12:11 shams2023

Working on the same, did anyone have any luck?

simeneide avatar Nov 13 '23 10:11 simeneide

any progress?

betterftr avatar Jan 07 '24 11:01 betterftr

Can you share code?

shams2023 avatar Mar 22 '24 09:03 shams2023