RAPiD icon indicating copy to clipboard operation
RAPiD copied to clipboard

Bug in rapid.py

Open Alex18947 opened this issue 3 weeks ago • 1 comments

Hi, while training with custom fisheye dataset on a pretrained COCO checkpoint, I encounter some "index out of bounds" problems in rapid.py: here

I am running with CUDA_LAUNCH_BLOCKING=1 to get the exact line. The error is:

./aten/src/ATen/native/cuda/IndexKernel.cu:93: operator(): block: [0,0,0], thread: [0,0,0] Assertion -sizes[i] <= index && index < sizes[i] && "index out of bounds" failed.

Adding some prints before the error I can see that penalty_mask is of size: torch.Size([4, 3, 136, 136])

Other values:

b: 2 best_n: tensor([0, 0, 0, 0, 2]) truth_j: tensor([141, 99, 47, 53, 97]) truth_i: tensor([ 81, 109, 86, 96, 108])

I don't seem able to find the root cause as for now, seems like it happens for some images only. Any ideas?

Alex18947 avatar Feb 02 '25 11:02 Alex18947