PlanetaryComputerExamples icon indicating copy to clipboard operation
PlanetaryComputerExamples copied to clipboard

Data not available in benchmark-tutorial.ipynb

Open robmarkcole opened this issue 3 years ago • 4 comments
trafficstars

Working through competitions/cloud-cover/benchmark-tutorial.ipynb on a hub instance, the notebook states the data should be available in a volume but this is not the case:

---------------------------------------------------------------------------
AssertionError                            Traceback (most recent call last)
Input In [4], in <cell line: 5>()
      2 TRAIN_FEATURES = DATA_DIR / "train_features"
      3 TRAIN_LABELS = DATA_DIR / "train_labels"
----> 5 assert TRAIN_FEATURES.exists()

AssertionError: 

robmarkcole avatar Aug 15 '22 12:08 robmarkcole

Thanks for the report. I was cleaning some things up in preparation for a Hub migration and completely forgot that this notebook existed :/

I'll need to think a bit about how to adjust for that. The notebook will fail to run in the meantime, unless you signed up for the competition: https://www.drivendata.org/competitions/83/cloud-cover/data/

TomAugspurger avatar Aug 15 '22 13:08 TomAugspurger

No problem! I did not sign up for the competition but appears as it has finished I cannot. I tried downloading but as this is 53GB I guess I wont have space. I only want to train on the NIR band, is there a way to request just that band?

robmarkcole avatar Aug 15 '22 13:08 robmarkcole

I tried downloading but as this is 53GB I guess I wont have space.

You might have space outside of your home directory (e.g. /tmp) but that's reset each time the notebook server restarts (https://planetarycomputer.microsoft.com/docs/overview/environment/#understanding-the-file-system).

is there a way to request just that band?

I'm not sure. With the assets hosted by the Planetary Computer you can access single bands. I don't recall how these assets were distributed, but it might have been a large gz or ZIP file.

TomAugspurger avatar Aug 15 '22 13:08 TomAugspurger

yes they are large gz. NP I will download elsewhere and create a NIR version for use here, thanks!

robmarkcole avatar Aug 15 '22 13:08 robmarkcole