SparK icon indicating copy to clipboard operation
SparK copied to clipboard

a small model such as Mobilenet v2 for pre-training

Open mmmz28 opened this issue 1 year ago • 3 comments

Thank you for your excellent work. Replacing the transformer with CNN does make deployment more friendly. Furthermore, I'm wondering if using a smaller model such as Mobilenet v2 for pre-training and then fine-tuning downstream would be effective?

mmmz28 avatar Apr 13 '23 02:04 mmmz28

Thank you and we agree that this could be of general interest and value. We will consider running SparK on mobilenet recently (perhaps v2 and v3), or you can try it out too. (see tutorial at https://github.com/keyu-tian/SparK/tree/main/pretrain#tutorial-for-pretraining-your-own-cnn-model).

keyu-tian avatar Apr 19 '23 12:04 keyu-tian

@keyu-tian Can I use swinv2-base as the backbone for pre-training?

xylcbd avatar May 09 '23 07:05 xylcbd

@xylcbd sorry but SparK is not suitable for this. Our SparK can pretrain any CNN model but swinv2 is a transformer. Maybe you can use MAE or SimMIM to pretrain swin transformer.

keyu-tian avatar May 12 '23 10:05 keyu-tian