segmentation_models.pytorch icon indicating copy to clipboard operation
segmentation_models.pytorch copied to clipboard

issue generating models with mix vision transformer encoders

Open K-D-Gallagher opened this issue 2 years ago • 6 comments

I noticed that Mix Vision Transformer encoders are now listed as available. I was trying to generate a unet model with one of these encoders, but have been getting an error that it's not a supported encoder. I tried mit_b0, 3, and 5. Is there a particular prefix to use with these, like with the timm encoders? Thanks.

K-D-Gallagher avatar Aug 26 '22 19:08 K-D-Gallagher

Hi, it is not released to pypi, please install latest version from github repo to use these encoders

qubvel avatar Aug 26 '22 19:08 qubvel

$ pip install -U git+https://github.com/qubvel/segmentation_models.pytorch

qubvel avatar Aug 26 '22 19:08 qubvel

Ah yes it worked, thank you so much! Looking forward to seeing how these encoders perform

K-D-Gallagher avatar Aug 26 '22 20:08 K-D-Gallagher

Let me know if you find it useful for your task)

qubvel avatar Aug 26 '22 20:08 qubvel

It definitely boosted performance a bit! Maybe ~2% over my previously best optimized architecture/encoder combo. Thanks!

K-D-Gallagher avatar Aug 30 '22 03:08 K-D-Gallagher

Sounds great! Will try to move to PyPI release soon!

qubvel avatar Aug 30 '22 15:08 qubvel

This issue is stale because it has been open 60 days with no activity. Remove stale label or comment or this will be closed in 7 days.

github-actions[bot] avatar Oct 30 '22 02:10 github-actions[bot]

This issue was closed because it has been stalled for 7 days with no activity.

github-actions[bot] avatar Nov 07 '22 02:11 github-actions[bot]