bkuster comments

Repositories
Issues
Comments

Results 2 comments of


                                            bkuster

[Question] Anyone could explain to me what does pretrain_mm_mlp_adapter means in lora file? 请问pretrain_mm_mlp_adapter是干什么用的啊？

(this is speculation/my understanding, not 100% accurate answer) 1) The "pretrain_mlp_adapter" is the file for the multi-layer perceptron weights. (the output tokens of the CLIP encoder are converted into "visual"...

Wondering whether CogVLM2 supports SFT for multi-image QA in a sample

As a hack, you can try "merging" several images into 1 image, but you'd probably have to finetune to model a bit.