Lorenzo Baraldi issues

Results 4 issues of


                                            Lorenzo Baraldi

[FEATURE] BEIT pre-training model

**Is your feature request related to a problem? Please describe.** There is no problem or bug **Describe the solution you'd like** I would like the implementation of BEIT pre-training pipeline...

enhancement

Gradient accumulation included into training script

Added parameter 'iters_to_accumulate' to perform [gradient accumulation](https://pytorch.org/docs/stable/notes/amp_examples.html#working-with-scaled-gradients) during training.

Information about implementation of BeiTv2

**Describe** Hi, I would like to know if layer scale is used (at 0.1) in finetuning BeiTv2 on the classification task. From the code point of view it seems that...

Request for clarification on implementation

Hi, after reading your paper and studying the code, I don't understand why VisionTransformerForMaskedImageModeling have two implementations of the encoder (respectively encoder and teacher model). Why is it not possible...