Metalhead.jl issues

Implementation of EfficientNetv2

1

This is an implementation of [EfficientNetv2](https://arxiv.org/abs/2104.00298). There's clearly some code duplication between the EffNets and the MobileNets, but figuring out how to unify that API and its design is perhaps...

theabhirath

new-model

Migrate docs to `Documenter.jl`

16

Xref #187 > **Note** > This PR does not improve the existing textual documentation in any way; instead, it migrates the current documentation to `Documenter.jl`. See the new documentation in...

Saransh-cpp

docs

discussion

Tweak GoogLeNet and Inception family to match the torchvision implementations

1

These changes will make it easier to port over pretrained weights for the models from PyTorch. - [ ] Right now, GoogLeNet matches the implementation in the paper, which does...

theabhirath

good first issue

model-bug

Optional fusion of convolution and BatchNorm layers during inference

Convolution and BatchNorm layers have been fused during inference in many models, most notably [LeViT](https://github.com/facebookresearch/LeViT). It would be a good idea to have this as an option in the `conv_norm`...

theabhirath

enhancement

layers

Potential enhancements for `DropBlock`

`DropBlock` is a type of regularisation that tries to replace dropout. The [original paper](https://arxiv.org/abs/1810.12890) describes it as best used with a linear scaling rate across blocks in a model, as...

theabhirath

enhancement

layers

Documentation revamp

4

The current documentation format is a little weird and Publish exposes a lot of private functions in the API reference as well. There are several steps that need to be...

theabhirath

documentation

Add more options to `ViT`

2

This modifies the current ViT API to add more options - notably, there is now a optional `prenorm/postnorm` toggle. There's also an option to make class tokens disappear completely, and...

theabhirath

Abstract type for the models

5

Given that all the high-level model APIs implement `(m::Model)(x)`, `backbone(m::Model)` and `classifier(m::Model)`, would it make sense to have an abstract type `MetalheadModel` that the models can subtype? I take it...

theabhirath

Issues with `ViT` on the GPU

6

With some experimentation, issues pop up when I use ViT on the GPU. Documenting these so that they can be tracked down and solved: - [x] Class tokens don't work...

theabhirath

Add `inchannels`, `imsize`, `nclasses` as kwargs for all constructors

11

I'm looking at Metalhead integration in MLJFlux. To do this well, I'm looking for some uniformity in the Metalhead.jl API that seems to be lacking. In particular, it would help...

ablaom

Metalhead.jl
Metalhead.jl copied to clipboard

Metadata

Implementation of EfficientNetv2

Migrate docs to `Documenter.jl`

Tweak GoogLeNet and Inception family to match the torchvision implementations

Optional fusion of convolution and BatchNorm layers during inference

Potential enhancements for `DropBlock`

Documentation revamp

Add more options to `ViT`

Abstract type for the models

Issues with `ViT` on the GPU

Add `inchannels`, `imsize`, `nclasses` as kwargs for all constructors

← Metadata

Owner

Metadata

Metalhead.jl Metalhead.jl copied to clipboard

Metadata

← Metadata

Owner

Metadata

Metalhead.jl
Metalhead.jl copied to clipboard