Number of pre-training parameters for BLIP2

Open Pefect96 opened this issue 2 years ago • 1 comments

In the paper of BLIP-2, Table 1 and Table 2 both show zero-shot results. Why are the number of trainable parameters different? Table 1 is 188M and Table 2 is 107M. Aren't all trainable parameters Q-former?

Aug 28 '23 12:08 Pefect96