kaggle_ndsb2017 icon indicating copy to clipboard operation
kaggle_ndsb2017 copied to clipboard

where are the data being stored??

Open AYadav01 opened this issue 7 years ago • 8 comments

Hi Julina,

Congratulation on doing such a great work. I just have few question about the directories where you stored the data. In 'setting.py', I see u are referring to following locations: BASE_DIR_SSD BASE_DIR EXTRA_DATA_DIR NDSB3_RAW_SRC_DIR LUNA16_RAW_SRC_DIR

I am kind of confused which folder contains what; where am i supposed to store the ndsb data and where to store the LUNA16 dataset.

Thank you so much.

AYadav01 avatar Jul 21 '17 16:07 AYadav01

It's best to take a look ate the sourcecode.

juliandewit avatar Aug 06 '17 10:08 juliandewit

@juliandewit still confused. Can you please explain more briefly?

nishat-sayyed avatar Jun 21 '18 19:06 nishat-sayyed

@AYadav01, I think this will help you.

BASE_DIR_SSD = "C:/werkdata/kaggle/ndsb3/" # create the folder name ndsb3 for saving the corresponding results BASE_DIR = "D:/werkdata/kaggle/ndsb3/" # create the folder name ndsb3 for placing the input data here EXTRA_DATA_DIR = "resources/" # place here extra data given by julian in his repository. NDSB3_RAW_SRC_DIR = BASE_DIR + "ndsb_raw/stage12/" # place here the kaggle data which will further LUNA16_RAW_SRC_DIR = BASE_DIR + "luna_raw/" # place here the LUNA16 database

all below directories are created for saving the corresponding results of the preprocessing and nodule detector script

NDSB3_EXTRACTED_IMAGE_DIR = BASE_DIR_SSD + "ndsb3_extracted_images/" LUNA16_EXTRACTED_IMAGE_DIR = BASE_DIR_SSD + "luna16_extracted_images/" NDSB3_NODULE_DETECTION_DIR = BASE_DIR_SSD + "ndsb3_nodule_predictions/"

bundelesneha05 avatar Jun 22 '18 04:06 bundelesneha05

Still confused. @bundelesneha05 Please can you create these directories with all the datasets and extras (including kaggle, luna, ndsb3 etc etc) in them and upload them somewhere? And please provide the link for the same.

nishat-sayyed avatar Jun 22 '18 09:06 nishat-sayyed

It will be better you should take look at the repository(All scripts). You will able to solve your confusion.

bundelesneha05 avatar Jun 22 '18 11:06 bundelesneha05

Thank you @bundelesneha05 I took a look through the code. As you said above, I understood most of the part. The only problem I am facing now is in the directory NDSB3_RAW_SRC_DIR. Actually, Kaggle is not providing the dataset now. If any of you has the dataset or any reference for the same then it would be more than helpful. Or if the dataset is not available anywhere, can anyone suggest how to train the model with a similar dataset? Thanks again @bundelesneha05 you helped a lot.

nishat-sayyed avatar Jun 22 '18 18:06 nishat-sayyed

I found the kaggle data here: https://github.com/smeerson/DataScienceBowl2017

laisecf avatar Nov 24 '18 20:11 laisecf

Kaggle dropped te data. But you can still train the model.. You just cannot predict.

juliandewit avatar Apr 29 '19 19:04 juliandewit