pytorch-image-models
pytorch-image-models copied to clipboard
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...
PatchMix is a novel image mix strategy for contrastive learning, which mixes multiple images in patch level and optimize the ViT model. code: https://github.com/visresearch/patchmix paper: https://arxiv.org/abs/2306.12243
Update the drop path schedule adheres to the original implementation found in DINOv2. Add an efficient drop path to accelerate training. #1836 Given 40% drop rate, we can see a...
**Is your feature request related to a problem? Please describe.** While current drop path implementation in TIMM doesn't save computation resources, implementing a **true** drop path that ignores unnecessary tokens...
Recently, we introduced [Tied-Augment](https://arxiv.org/abs/2305.13520), a simple framework that combines self-supervised learning learning and supervised learning by making forward passes on two augmented views of the data with tied (shared) weights....
I encounter following error with timm 0.9.1 ```shell Traceback (most recent call last): File "download_models.py", line 63, in model = timm.create_model('eva02_enormous_patch14_clip_224.pretrain', pretrained=True) File "/home/luoxin/.local/lib/python3.8/site-packages/timm/models/_factory.py", line 114, in create_model model =...
**Is your feature request related to a problem? Please describe.** In [TorchGeo](https://github.com/microsoft/torchgeo)'s unit tests, we want to avoid any large models that require a lot of memory or time to...
## Modification 1. Add an option to fuse layerscale into last linear in Mlp. Less elementwise operations improves amp train/infer speed. 2. Reshape x for Mlp, which slightly improves speed....
Requesting the addition of CLIP pretrained weights to ResNet model. This would greatly improve the performance of ResNet by allowing it to leverage the powerful representations learned by CLIP in...
SelfPatch is a wonderful self-supervised training strategy which could be used to train a VIT, it encourages the model to learn semantic correlations among patches (better than DINO as shown...
This PR adds support for Tensorboard to the train script. solves https://github.com/huggingface/pytorch-image-models/issues/1716. It differs from #1719 by that the logs are saved each update instead of each epoch. giving a...