LAVIS icon indicating copy to clipboard operation
LAVIS copied to clipboard

What is the distribuetd framework when training the model named BLIP-2 ViT-G FlanT5xxl?

Open qiao1025566574 opened this issue 1 year ago • 0 comments

The "BLIP-2 ViT-G Flan-T5-XXL" has 12.1B parameters. Therefore, how do you load this model to cuda memory? Do you use some skills like model sharding or the framework like Deepspeed?

qiao1025566574 avatar Mar 05 '23 15:03 qiao1025566574