SparK a small model such as Mobilenet v2 for pre-training

a small model such as Mobilenet v2 for pre-training

Open mmmz28 opened this issue 1 year ago • 3 comments

Thank you for your excellent work. Replacing the transformer with CNN does make deployment more friendly. Furthermore, I'm wondering if using a smaller model such as Mobilenet v2 for pre-training and then fine-tuning downstream would be effective?

Apr 13 '23 02:04 mmmz28

Thank you and we agree that this could be of general interest and value. We will consider running SparK on mobilenet recently (perhaps v2 and v3), or you can try it out too. (see tutorial at https://github.com/keyu-tian/SparK/tree/main/pretrain#tutorial-for-pretraining-your-own-cnn-model).

Apr 19 '23 12:04 keyu-tian

@keyu-tian Can I use swinv2-base as the backbone for pre-training?

May 09 '23 07:05 xylcbd

@xylcbd sorry but SparK is not suitable for this. Our SparK can pretrain any CNN model but swinv2 is a transformer. Maybe you can use MAE or SimMIM to pretrain swin transformer.

May 12 '23 10:05 keyu-tian

SparK SparK copied to clipboard

a small model such as Mobilenet v2 for pre-training

SparK
SparK copied to clipboard