JackieWu comments

Results 88 comments of


                                            JackieWu

transformations in MiniViT paper

Hi @gudrb , thanks for your attention to our work! In Mini-DeiT, the transformation for MLP is the relative position encoding https://github.com/microsoft/Cream/blob/4a13c4091e78f9abd2160e7e01c02e48c1cf8fb9/MiniViT/Mini-DeiT/mini_vision_transformer.py#L117 In Mini-Swin, the transformation for MLP is the...

transformations in MiniViT paper

> On the MiniViT paper, > > We make several modifi�cations on DeiT: First, we remove the [class] token. The model is attached with a global average pooling layer and...

transformations in MiniViT paper

Hi @gudrb , The following code creates a list of LayerNorm, where the number of LayerNorm is `repeated_times`. https://github.com/microsoft/Cream/blob/4a13c4091e78f9abd2160e7e01c02e48c1cf8fb9/MiniViT/Mini-DeiT/mini_vision_transformer.py#L145-L146 RepeatedModuleList will select the `self._repeated_id`-th LayerNorm to forward. https://github.com/microsoft/Cream/blob/4a13c4091e78f9abd2160e7e01c02e48c1cf8fb9/MiniViT/Mini-DeiT/mini_vision_transformer.py#L28-L29 In `RepeatedMiniBlock`,...

JackieWu

transformations in MiniViT paper

transformations in MiniViT paper

transformations in MiniViT paper

transformations in MiniViT paper

transformations in MiniViT paper

error occurred while saving states

Does this implementation maintain the momentum?

question about the paper

question about the paper

Support for MS-AMP in FSDP