LAVIS About itm loss

About itm loss

Open qibao77 opened this issue 1 year ago • 5 comments

Thank you for your code! When I reproduce the stage 1 trainging, I find that the itm loss does not convergent, is it normal? Or is there any trick? (note: I replace the bert with xlmr model)

Apr 24 '23 08:04 qibao77

LAVIS LAVIS copied to clipboard

About itm loss

LAVIS
LAVIS copied to clipboard