EVA
EVA copied to clipboard
Does intermediate fine-tuning on IN21K improve downstream performance?
For example, Merged-38M MIM pretraining -> IN21K finetuning -> downstream task finetuning
On ADE20K, IN-21K intermediate finetuning slightly degenerates the performance.