Calmepro777
Calmepro777
> Did you find a solution? I am reshaping all images to 224X224, but that seems a bit fishy, especially with the varying aspect ratio. That was how I tackle...
> > > > I use the code in the `README.md (Zero-Shot Prediction)` to test the acc of ViT/B-32 on the CIFAR100 dataset and get the result of about 62%...
> Hi, > > There can be numerical differences that we cannot fully control, e.g. different CUDA and driver versions, batch sizes, hardware, etc., that may cause the 0.5% difference...
Here is the best results I obtained: image encoder: ViT-B/16 prompt: "itap of a {label}." | Dataset | Reproduced Acc. | Reported Acc. | Gap | | ------------- | -------------...
I noticed that my GPU utility and VRAM usage is so low, 2% and ~2GiB respectively, any hint on resolving this problem?Is there a specific hyper-parameter I should set to...