pytorch-image-models issues

Support third-party backend for Timm

1

For Timm supporting third-party backend-NPU, here is a PR opened for compatible. Note: We can specify a config.yaml as the value of the ‘config’ variable to activate a third-party backend:...

Tyx-main

Packed Sequence Vision Transformer (aka NaViT)

5

A big WIP, pushing early to resolve masking stability issues with F.sdpa

rwightman

Fix Quantization Aware Training for BEIT, Eva, and SwinTransformerV2

2

Dear all, When trying to perform Quantization Aware Training (QAT), modules are being wrapped with a [QuantWrapper](https://pytorch.org/docs/stable/generated/torch.ao.quantization.QuantWrapper.html). But, because some models are implementing `qkv` with biases using `torch.nn.functional`, one has...

clementpoiret

Support different image size for ViT with relative position encoding.

4

Currently, timm support different image size in testing time for ViT with absolute position encoding, and ViT with relative position encoding is not supported. However, these ones with relative position...

Luciennnnnnn

enhancement

typographical error in train.py

Hi，I found a typographical error in train.py in line 628 where the ‘pipeiine‘’ should be ‘pipeline’ https://github.com/huggingface/pytorch-image-models/blob/b996c1a0f5068e7f5dfe69429e59e873536754c9/train.py#L628

Tyx-main

[FEATURE] Feature only support for Twins-PVT and Mvitv2

1

Both are pyramid networks and can be used for multi-scale feature extraction, but to my knowledge do not support it like similar architectures such as PVT or Swin.

L-Reichardt

enhancement

DINOv2 worse performance compared to the original version

5

I've trained Vision Transformer (ViT) models, small and large, with DINOv2 pretrained weights from [Facebook](https://github.com/facebookresearch/dinov2) (vit_small_patch14_reg4_dinov2.lvd142m) and timm (dinov2_vits14_reg_lc). The timm version underperforms, as seen in feature and attention map,...

davissf

bug

[FEATURE] License in csv

**Is your feature request related to a problem? Please describe.** Evaluating potential models is not only related to performance but also licensing e.g. can model be used commercially. Therefore, it...

nietras

enhancement

[FEATURE] timm.list_models(features_only=True)

6

**Is your feature request related to a problem? Please describe.** I am building a library to automatically build any decoder for any timm encoder called [mmit](https://github.com/abcamiletto/mmit) Due to the need...

abcamiletto

enhancement

[FEATURE] Add ImageBind

1

Add Meta's ImageBind "ImageBind: One Embedding Space To Bind Them All" https://github.com/facebookresearch/ImageBind We would implement the embeddings for images modality.

raulcarlomagno

enhancement

pytorch-image-models
pytorch-image-models copied to clipboard

Metadata

Support third-party backend for Timm

Packed Sequence Vision Transformer (aka NaViT)

Fix Quantization Aware Training for BEIT, Eva, and SwinTransformerV2

Support different image size for ViT with relative position encoding.

typographical error in train.py

[FEATURE] Feature only support for Twins-PVT and Mvitv2

DINOv2 worse performance compared to the original version

[FEATURE] License in csv

[FEATURE] timm.list_models(features_only=True)

[FEATURE] Add ImageBind

← Metadata

Owner

Metadata

pytorch-image-models pytorch-image-models copied to clipboard

Metadata

← Metadata

Owner

Metadata

pytorch-image-models
pytorch-image-models copied to clipboard