Shashank Gupta
Shashank Gupta
Hi, I am trying out this model on my custom dataset with the following frequency distribution of class labels : 7: 23849, 0: 15159, 1: 6445, 4: 5759, 5: 3969,...
### System Info ### System Info - `transformers` version: 4.44.2 - Platform: Linux-5.15.0-1068-aws-x86_64-with-glibc2.31 - Python version: 3.9.19 - Huggingface_hub version: 0.24.7 - Safetensors version: 0.4.5 - Accelerate version: 0.34.2 -...
Hi, I am running the model on ImageNet data from scratch, using the best config from the paper (small encoder and large decoder), training on a cluster of 56 A100...