mlp-mixer-pytorch icon indicating copy to clipboard operation
mlp-mixer-pytorch copied to clipboard

I want to know why initialize dim,token_dim and channel_dim using 512,256,2048?

Open jiantenggei opened this issue 3 years ago • 1 comments

jiantenggei avatar Jun 10 '21 07:06 jiantenggei

One reason could be Computer/GPU algorithms in general are optimized for values in powers of 2 because that way we can utilize the memory access in the most efficient manner.

deepsworld avatar Apr 01 '22 18:04 deepsworld