pytorch-image-models icon indicating copy to clipboard operation
pytorch-image-models copied to clipboard

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT)...

Results 237 pytorch-image-models issues
Sort by recently updated
recently updated
newest added

PatchMix is a novel image mix strategy for contrastive learning, which mixes multiple images in patch level and optimize the ViT model. code: https://github.com/visresearch/patchmix paper: https://arxiv.org/abs/2306.12243

Update the drop path schedule adheres to the original implementation found in DINOv2. Add an efficient drop path to accelerate training. #1836 Given 40% drop rate, we can see a...

**Is your feature request related to a problem? Please describe.** While current drop path implementation in TIMM doesn't save computation resources, implementing a **true** drop path that ignores unnecessary tokens...

enhancement

Recently, we introduced [Tied-Augment](https://arxiv.org/abs/2305.13520), a simple framework that combines self-supervised learning learning and supervised learning by making forward passes on two augmented views of the data with tied (shared) weights....

enhancement

I encounter following error with timm 0.9.1 ```shell Traceback (most recent call last): File "download_models.py", line 63, in model = timm.create_model('eva02_enormous_patch14_clip_224.pretrain', pretrained=True) File "/home/luoxin/.local/lib/python3.8/site-packages/timm/models/_factory.py", line 114, in create_model model =...

bug

**Is your feature request related to a problem? Please describe.** In [TorchGeo](https://github.com/microsoft/torchgeo)'s unit tests, we want to avoid any large models that require a lot of memory or time to...

enhancement

## Modification 1. Add an option to fuse layerscale into last linear in Mlp. Less elementwise operations improves amp train/infer speed. 2. Reshape x for Mlp, which slightly improves speed....

Requesting the addition of CLIP pretrained weights to ResNet model. This would greatly improve the performance of ResNet by allowing it to leverage the powerful representations learned by CLIP in...

enhancement

SelfPatch is a wonderful self-supervised training strategy which could be used to train a VIT, it encourages the model to learn semantic correlations among patches (better than DINO as shown...

enhancement

This PR adds support for Tensorboard to the train script. solves https://github.com/huggingface/pytorch-image-models/issues/1716. It differs from #1719 by that the logs are saved each update instead of each epoch. giving a...