dinov2
dinov2 copied to clipboard
Loss doesn't go down.
Take 5s video segments form hundreds of videos, each 5s video segment takes 10 frames of images to train DINOv2 from the beginning, the input tensor shape of the model is [B,3,10,H,W], the batchsize is set to 12, run on 4 A100 (80GB) GPUs, the training parameter defaults to /configs/train/ vitl14.yaml.
Same here, did you finally solve that issue?