pytorch-image-models issues

Add ViG models [NeurIPS 2022]

5

Add ViG models from paper: Vision GNN: An Image is Worth Graph of Nodes (NeurIPS 2022), https://arxiv.org/abs/2206.00272 Network architecture plays a key role in the deep learning-based computer vision system....

iamhankai

Use in-place operations for EMA

4

Hi, I noticed that the EMA used here is pretty slow, since it does not use in-place operations. Using in-place ops results in a ~50% faster EMA, however, it does...

jeromerony

[FEATURE]features_only option on vision transformer

I'm trying to apply swin v2 as a backbone of dense prediction tasks such as depth estimation of sementic segmentation. However, I found that features_only option is unavailable on vision...

201820894

enhancement

[FEATURE] Support of FocalNet

3

Hello How are you? Thanks for contributing to this project. Did u implement FocalNet in this repo? If NOT, could u support FocalNet in repo ASAP? Thanks

rose-jinyang

enhancement

[BUG] Python 3.11 Error >>> import timm ValueError: mutable default <class 'timm.models.maxxvit.MaxxVitConvCfg'> for field conv_cfg is not allowed: use default_factory

1

>>> import timm Traceback (most recent call last): File "", line 1, in File "/home/ubuntu/.local/lib/python3.11/site-packages/timm/__init__.py", line 2, in from .models import create_model, list_models, is_model, list_modules, model _entrypoint, \ File "/home/ubuntu/.local/lib/python3.11/site-packages/timm/models/__init__.py...

makao007

bug

[FEATURE] convert model for 1D inputs

7

Is there a way to convert timm models for 1D inputs? I realize that a 1D tensor with shape [B,C,S] can be reshaped to [B,C,1,S] or [B,C,S,1], but then the...

pfeatherstone

enhancement

Can you add "PP-LCNet: A Lightweight CPU Convolutional Neural Network" (PP-LCNet-2x,PP-LCNet-2.5x)

As above

Snailgoo

enhancement

[DOC] Add Link to the recipe for each model

1

Adding the recipe used to train each model would be a step forward in the documentation.

mjack3

enhancement

[FEATURE] Support variable input size of maxvit and coatnet

1

Will it be possible in the future to support variable input sizes for maxvit and coatnet? I am experimenting with adapting various models of timm to self-supervised learning such as...

abebe9849

enhancement

[FEATURE] Huge discrepancy between HuggingFace and timm in terms of the initialization of ViT

7

I see a huge discrepancy between HuggingFace and timm in terms of the initialization of ViT. Timm's implementation uses trunc_normal whereas huggingface uses "module.weight.data.normal_(mean=0.0, std=self.config.initializer_range)". I noticed this cause a...

Phuoc-Hoan-Le

enhancement

pytorch-image-models
pytorch-image-models copied to clipboard

Metadata

Add ViG models [NeurIPS 2022]

Use in-place operations for EMA

[FEATURE]features_only option on vision transformer

[FEATURE] Support of FocalNet

[BUG] Python 3.11 Error >>> import timm ValueError: mutable default <class 'timm.models.maxxvit.MaxxVitConvCfg'> for field conv_cfg is not allowed: use default_factory

[FEATURE] convert model for 1D inputs

Can you add "PP-LCNet: A Lightweight CPU Convolutional Neural Network" (PP-LCNet-2x,PP-LCNet-2.5x)

[DOC] Add Link to the recipe for each model

[FEATURE] Support variable input size of maxvit and coatnet

[FEATURE] Huge discrepancy between HuggingFace and timm in terms of the initialization of ViT

← Metadata

Owner

Metadata

pytorch-image-models pytorch-image-models copied to clipboard

Metadata

← Metadata

Owner

Metadata

pytorch-image-models
pytorch-image-models copied to clipboard