Results 952 comments of Tom Augspurger

Thanks for the report. I was cleaning some things up in preparation for a Hub migration and completely forgot that this notebook existed :/ I'll need to think a bit...

> I tried downloading but as this is 53GB I guess I wont have space. You might have space outside of your home directory (e.g. `/tmp`) but that's reset each...

That turns up another error: ``` image_link = source_imagery_links[0] response = session.get(image_link.href) image_item = Item.from_dict(response.json()) print(f"Item ID: {image_item.id}") print("Assets:") for asset_key, asset in image_item.assets.items(): print(f"- Asset Key: {asset_key}") asset_eo_ext =...

I made some progress on this by changing the `collection_id` to `ref_landcovernet_af_v1_labels`. With that change it's failing at https://nbviewer.org/github/microsoft/PlanetaryComputerExamples/blob/main/tutorials/radiant-mlhub-landcovernet.ipynb#Downloading-Source-Imagery. We're getting a 404 when we try to access the source...

IIUC, for intake-stac the expensive part starts with `StacCatalog._stac_obj.items()`. Creating each `stacsat.Item` is what's taking time. We can quickly do `self._stac_obj.links("item")` to get the URLs for all the items, and...

> The thing I'm not sure about is getting the id from the filename. Is it always true that that satstac.Item.id is in the filename, or is that just a...

Hmm, OK thanks. I'm not sure where that leaves us then... We could likely hack something together where `StacCatalog.__getitem__(key)` looks into the `{key}.json` first, validate that the id matches the...

One question about the CSV catalog: this is a reference to the "source of truth" data living on GCS at `gcs://pangeo-cmip6/AR6_WG1/...`? Will the dataset be changing over time? If so,...

> To further clarify, listing a large bucket like this with gcsfs takes > 10 minutes. That's unacceptably long for our users. Oh, yeah that's way too long for what...

> I suspect that if this were using fsspec's version of HTTPFileSystem, rather than the old one in dask, it would work as required. Thanks. Do you have a branch...