course-v3
course-v3 copied to clipboard
lesson3-planet download links using kaggle api
In the notebook lesson3-planet (nbs/dl1/lesson3-planet.ipynb), the kaggle API command is not able to download the train files.
! kaggle competitions download -c planet-understanding-the-amazon-from-space -f train-jpg.tar.7z -p {path}
! kaggle competitions download -c planet-understanding-the-amazon-from-space -f train_v2.csv -p {path}
Returns 404 File not Found
List of files shows train-jpg is not present:
kaggle competitions files -c planet-understanding-the-amazon-from-space
name size creationDate
------------------------------------------------- ----- -------------------
test_v2_file_mapping.csv/test_v2_file_mapping.csv 600KB 2019-12-15 22:14:13
sample_submission_v2.csv/sample_submission_v2.csv 3MB 2019-12-15 22:14:13
train_v2.csv/train_v2.csv 1MB 2019-12-15 22:14:13
Kaggle-planet-train-tif.torrent 1MB 2019-12-15 22:14:13
Kaggle-planet-test-tif.torrent 2MB 2019-12-15 22:14:13
Partial solution: fix download of train_v2.csv by using updated path: kaggle competitions download -c planet-understanding-the-amazon-from-space -f train_v2.csv/train_v2.csv
Workaround: Downloading train-jpg.tar manually from kaggle's page (https://www.kaggle.com/c/planet-understanding-the-amazon-from-space/data)
It seems like someone has uploaded the jpgs to a different project as well.
You can get what I believe are the same files from nikitarom/planets-dataset
using:
!kaggle datasets download nikitarom/planets-dataset
The training csv has a different name and the paths are slightly different but it seems to a way forward.
This issue confused a whole group of us who are doing the class together, it would be good to update the notebook with some notes about a workaround.
Get your cookie and use wget to download the big file, More info here: https://forums.fast.ai/t/lesson-3-cant-download-planet-data-images-tar-archive/60309/21?u=yiqun
You can get what I believe are the same files from
nikitarom/planets-dataset
using:
THANK YOU!