Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection
Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection copied to clipboard
test AUC
Hello, I trained a total of 110000 real images and 100000 fake images using dfdc and ff++, but the final test only achieved an AUC of 0.885. Can you give me some suggestions. Thank you.
Hi, which version of the model are you using?
efficient-vit This is the model i'm using.
The EfficientViT obtain an AUC of 0.919 which is not too far from yours. If I understand well, your training set is not complete so it is normal to obtain different results. Anyway, if you want to improve more, I suggest to use Cross Efficient ViT which is our main method.