faster-rcnn-pytorch How to train a multiple channel image

How to train a multiple channel image

Open lihaolin88 opened this issue 3 years ago • 3 comments

Hello, thank you for this project! Right now I want to train this network with 2 channel image, but in the code, I saw it just accept 1 or 3 channels image, so I just wondering is that okay for me to add some layers in faster_rcnn.py or is here have some other way to process 2 channels image? Thank you so much!

Oct 05 '21 17:10 lihaolin88

Hm... In this lines of code of collate.py file:

    if len(im.shape) == 2:
        im = im[:, :, np.newaxis]
        im = np.concatenate((im, im, im), axis=2)

you can see check, is there third dimension in image DATA. So grayscale image has only one dimension, but RGB has three dimension (1 - for X coord, 2 - for Y coord, 3 - for channels -> R, G, B).

I suppose your 2 channel image: grayscale + alpha. So, your image DATA still has three dimensions (1 - for X coord, 2 - for Y coord, 3 - for channels -> Gray, Alpha). Or am i wrong?

Oct 06 '21 03:10 loolzaaa

Thank you for your reply! this two-channel image is generated by myself, so it exactly just have 2 channels; what I'm doing right now is add some layers in faster_rcnn.py transform 2 channels to 3 channels, but it seems not works right.

Oct 06 '21 03:10 lihaolin88

this two-channel image is generated by myself, so it exactly just have 2 channels

I'm understood it, but after PyTorch read your image as data array, it must have 3 dimension.

BEFORE you change some code, WHERE you get an error, concrete link, please.

Oct 20 '21 08:10 loolzaaa

faster-rcnn-pytorch faster-rcnn-pytorch copied to clipboard

How to train a multiple channel image

faster-rcnn-pytorch
faster-rcnn-pytorch copied to clipboard