LAVIS
LAVIS copied to clipboard
Number of pre-training parameters for BLIP2
In the paper of BLIP-2, Table 1 and Table 2 both show zero-shot results. Why are the number of trainable parameters different? Table 1 is 188M and Table 2 is 107M. Aren't all trainable parameters Q-former?