fastai icon indicating copy to clipboard operation
fastai copied to clipboard

[WIP] Simple timm splitter funcs

Open tcapelle opened this issue 3 years ago • 3 comments

A simple method to split timm models. My main concer is that I am not createing the model_meta dict and instead at the moment when the model is created, I am constructing the splitter. I will train some models on monday. cc. @jp00

  • requires timm from master

tcapelle avatar May 06 '22 17:05 tcapelle

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

Thanks for the great PR, @tcapelle ! I'll wait to hear back from @rwightman about a pypi release of the needed functionality before we merge this.

jph00 avatar May 06 '22 20:05 jph00

I am pasting this here. Have been running a bunch of sweeps comparing this timm split vs the default_split method here: https://wandb.ai/capecape/fine_tune_timm/sweeps .

(3 epoch finetune with 1/0 epoch fit)

A non-exhaustive exploration here shows that default_split works sometimes better for very short finetuning(3 epochs).

  • resnet34 image

-regnetx_040 image

  • convnext_tiny

This model is insane!! 🚀 image

tcapelle avatar May 09 '22 16:05 tcapelle