Phil Wang comments

Results 814 comments of


Phil Wang

BYOL collapses

@mark-selyaeff Hey Mark! So the LR in the paper is actually specific to LARS I believe For Adam, try using a smaller learning rate (3e-4)

Code error when using torch.nn.DataParallel for multi-gpu: AssertionError: hidden layer avgpool never emitted an output

@SilverUnicorn Hi! thanks for reporting this! would you like to give 0.5.3 a try? I implemented the solution that @Vurkty linked to!

Code error when using torch.nn.DataParallel for multi-gpu: AssertionError: hidden layer avgpool never emitted an output

https://github.com/lucidrains/byol-pytorch/commit/8b0be4859305c6635a18b36e17d7bec85c1c9c9e

Error when running the example code

hmm I'm not too sure actually, does it work if you restrict it to 1 gpu? `CUDA_VISIBLE_DEVICES=0 python train.py`?

Error when running the example code

@jlindsey15 ok, i think i know why. https://github.com/lucidrains/byol-pytorch/commit/e3c245311bcfee58982d1a63e66587f9b302ba31 can you try again?

Error when running the example code

@jlindsey15 ahh, I believe you must have some greyscale images (mixed in with colored images) in your folder

Error when running the example code

@jlindsey15 ok, try again, this should fix the greyscale image problem, but it's probably better that you make sure it isn't auto-including some image you don't want pretraining on https://github.com/lucidrains/byol-pytorch/commit/87232dc89bd6b51fdf24d5174c20f806bad5b98e