jepa
jepa copied to clipboard
Experimental setup for the Low-Shot Frozen Evaluation (Table 7)
Hey, I was wondering what the number of optimization steps (or epochs) and batch sizes are used for the Low-Shot Frozen Evaluation experiment (Table 7 in the V-JEPA paper).
Is there any other hparam different from the experiments that use the full training set?