mobilevit-pytorch icon indicating copy to clipboard operation
mobilevit-pytorch copied to clipboard

A PyTorch implementation of "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer"

Results 14 mobilevit-pytorch issues
Sort by recently updated
recently updated
newest added

@chinhsuanwu , thank you so much for uploading the code. * There are small issues regarding the structure of model that you've designed, I have updated the code accordingly in...

Hi @chinhsuanwu, Could you please update your model by adding initialisation code? And apple has released the [weights](https://github.com/apple/ml-cvnets/blob/main/docs/source/en/general/README-model-zoo.md) for MobileViTs, could you also please add option to load pre-trained weights?...

I test my code using pretrained mobilevit_xs and pretrained HRNet(Deep High-Resolution Representation Learning for Visual Recognition) by my dataset. I checked that mobilevit_xs have less x8 parameters than HRNet and...

If I want to use Mobilevit in Cifar10, how should I set the parameters? Because I changed the input size, but the parameters don't match. For example: --------------------------------------------------------------------------------------------------------------------------------------- EinopsError: Shape...

I don't think mobile just have less parameters.

Thanks for sharing your code! The inference speed of mobilevit is 40 ms/img. How to reproduce the speed in Table3?

If you have, could you share the result of the experiments? Anyway, thank you for your sharing the clear implementation!

In my test, mobilevit_xs process one image will cost 113ms, which is much larger than the value in the paper.