fast-autoaugment
fast-autoaugment copied to clipboard
Stuck in search.py
First of all, thanks you very much for your generous to sharing code public.
My problem happen after i changed @ray.remote(num_gpus=4, max_calls=1) and ray.init(redis_address=args.redis) to @ray.remote(num_gpus=1, max_calls=1) and ray.init() in search.py,the running state of the code remains as shown in the image below.The code ran for two hours without making any progress.And it does not return any error messages.
Hope that you can answer question soon.Thank you very much!
P/s I use "!python3 FastAutoAugment/search.py -c confs/wresnet40x2_cifar.yaml" to run this code.
I also have the same problem. Does anyone have a solution?
me too
same problem without using ray cluster framework
any solution for this issue?
I found out that it stucks in infinite loop in search.py 'while True' cycle
When i raised KeyboardInterrupt, it shows:
I removed ray cluster's lines so line numbering may differ, but stuck occurs here:
interestingly that this stuck occurs on different stages - first, when training 2 of 5 model, second, when training first 5 of 5 models has finished.