segtran icon indicating copy to clipboard operation
segtran copied to clipboard

problem about dataset

Open rezashaemi opened this issue 2 years ago • 17 comments

Thanks for your great project! I downloaded REFUGE dataset but the name and number of image files are different from names in this repo! For example, training part has 360 files with names 0001.jpg, ...

rezashaemi avatar Apr 05 '22 04:04 rezashaemi

Hi @rezashaemi , you can download the preprocessed data from baidu netdisk: https://pan.baidu.com/s/1HAB5AMV8gqgoi5joUFdpVQ?pwd=5iud

askerlee avatar Apr 05 '22 04:04 askerlee

Thanks a lot! It seems it doesn't allow someone out of china download data!

rezashaemi avatar Apr 05 '22 07:04 rezashaemi

Sorry for that. Could you pls try the dropbox link: https://www.dropbox.com/s/fd8hty3tgwwsl1i/refuge2020.tar?dl=0

askerlee avatar Apr 05 '22 07:04 askerlee

Great! Thanks again,

rezashaemi avatar Apr 05 '22 07:04 rezashaemi

hi Thank you for great code I also have the same problem with the rim database. The name of the images I downloaded is different from the name of the data code.

mahdiyehbakhshi822 avatar Apr 05 '22 19:04 mahdiyehbakhshi822

hi @mahdiyehbakhshi822 , the preprocessed RIM data can be downloaded at: https://www.dropbox.com/s/8osan2fuon7nb0q/rim.zip?dl=0

askerlee avatar Apr 06 '22 04:04 askerlee

Thank you

mahdiyehbakhshi822 avatar Apr 06 '22 04:04 mahdiyehbakhshi822

Sir,can you tell me something about the dataset you give? Thank u. 1.refuge2020g0001_576_597,0``n0001_576_712,18 they looks like 5 parts, I guess "g" and "n" are glaucoma or not."576" is the size, the other parts I don't knew. drishtitrain-089_576_32,32``` what's the meaning of "32" "32" 2. the lists show images/g0002_560_697,47.png images/g0002_576_689,39.png`,why there are two of each one image? Sorry to bother you again.

Lemonweier avatar May 08 '22 09:05 Lemonweier

@Lemonweier the two numbers in the filename are the (x,y) coordinates that each image is cropped at: https://github.com/askerlee/segtran/blob/master/code/MNet_DeepCDR/Step_1_Disc_Crop.py#L124

askerlee avatar May 08 '22 09:05 askerlee

Thank you for your reply!

Lemonweier avatar May 08 '22 10:05 Lemonweier

Thanks for the project, but is there any way to train the REFUGE2020 model without the help of drishti and rim datasets? I downloaded your cropped refuge dataset and removed the 'drishti,rim' from attribute ds_names inside fundus in train2d.py and trained&tested the model accordingly, but the dice for both cup and disk is tested to be 0, i fear there is something i did wrong, can you help me? thanks in advance!

OvO1111 avatar Apr 02 '23 11:04 OvO1111

Hi @OvO1111 yes of course you can train without drishti and rim. I guess maybe when you modified ds_names, some kind of bug was introduced? Could you try an alternative approach: do not change the code, but only remove the image and mask files in the drishti and rim folders, and do the training?

askerlee avatar Apr 02 '23 14:04 askerlee

ok thanks for your help, i'll try

EDIT: after running ./train2d.sh --task fundus --split all --translayers 3 --layercompress 1,1,2,2 --net segtran --bb eff-b4 --maxiter 10000 --bs 6 --noqkbias, it prompts FileNotFoundError: [Errno 2] No such file or directory: '../data/fundus/drishti/images', this trace back to db_train = init_training_dataset(...) in train2d.py, i guess i have to modify theds_names somehow...

OvO1111 avatar Apr 02 '23 14:04 OvO1111

Oh i'm sorry, removing the dataset names from ds_names should work, i accidentally made a mistake in the original code, thanks for your support

OvO1111 avatar Apr 03 '23 02:04 OvO1111

Hi @rezashaemi , you can download the preprocessed data from baidu netdisk: https://pan.baidu.com/s/1HAB5AMV8gqgoi5joUFdpVQ?pwd=5iud

REFUGE20验证集有400张图片,但链接只有120张?

geek-APTX4869 avatar Apr 15 '24 12:04 geek-APTX4869

I've checked. There are four folders, each folder having 400 images.

askerlee avatar Apr 15 '24 12:04 askerlee

I've checked. There are four folders, each folder having 400 images.

all the test.list have 120 lines

geek-APTX4869 avatar Apr 19 '24 10:04 geek-APTX4869