ml4floods
ml4floods copied to clipboard
[DataPrep] csv file with train/val/test files
We need a csv file with all of the filenames for the images and ground truths. A simple glob will be sufficient.
Format
Some key columns for easy querying.
- filename, e.g. filename
- filepath, e.g. path/to/file
- bucket, e.g. ml4floods
- split - e.g., train, test, val
TiffImages
These are the original tiff images that were used before the train-test-split.
Location: Should be in the ml4floods_data_lake
directory. The csv file should be at the top of the directory where the tiffimages are located.
- [x] CSV File
Train/Test/Val
This will be the train/test/val split data.
Location: Should be in the ml4floods_data_lake
directory. The csv file should be at the top of the directory where the train/val/split is located.
- [x] CSVFile