Feilong Chen

Results 7 comments of Feilong Chen
trafficstars

I guess you could check the pytorch version. The code is tested on v0.3.0.

I have modified the code for pretraining from scratch and do not load a pre-trained blip2 checkpoint. But the loss is still difficult to reduce. I just changed the part...

``` \\ blip2_qformer load_finetuned = cfg.get("load_finetuned", True) load_pretrained = cfg.get("load_pretrained", True) if load_finetuned or load_pretrained: model.load_checkpoint_from_config(cfg) return model ``` stage 1 logs for 3m images {"train_lr": "0.000", "train_loss": "7.367"} {"train_lr":...

Thanks. Could you tell me how much loss is a good convergence of stage 1 and stage 2? When I train the model for 10 epochs. the loss seems still...

OK. I will try again and give back the results. Thank you for your patience.

Yes. Thanks for the authors' excellent works. In the process of trying to train from scratch, I found that the language model I used at the beginning was too small...