ValueError: batch_size should be a positive integer value, but got batch_size=0
Hi @amritamaz and team that worked on Queen. Such exciting technology. Thank you very much for your efforts.
I am trying to run train.py on custom data and I receive the following error:
Traceback (most recent call last):
File "/home/location/queen/train.py", line 1193, in
Not sure why I'm getting this error. I was able to build on CUDA 12.4 running on Pop!_OS 22.04 on a RTX 4090.
Has anyone else encountered? Any fixes I should consider?
Thanks so much!
Could be related to #6 ?
I have seen this when the dataset_reader does not successfully load training images (you can check this by checking len(train_image_dataset) before line 99). Can you check your dataset organization against the updated README (fixes #6) and check if your train_image_dataset has been populated correctly?
Thanks so much. I did have an issue with my directory structure. I did not have images directories within each of the cameras. After running train.py and framer iteration completes, I receive a CUDA out of memory issue. I'm running this on a 4090 card. Is there a way to optimize so I can run without CUDA error on this card?
Hi @michnaugh1 , can you paste the full error trace? Where do you get the error exactly? I would expect the code would be able to run on a 4090.