Tom Augspurger
Tom Augspurger
Currently we don't traverse builtin collections (aside from dictionaries with a known set of keys for kerchunk). With https://github.com/stac-utils/pystac-client/issues/237 we might be pushing users towards `list(search.get_item())` and we'd want to...
Improve coverage of `NDFrame.__finalize__` Pandas uses `NDFrame.__finalize__` to propagate metadata from one NDFrame to another. This ensures that things like `self.attrs` and `self.flags` are not lost. In general we would...
It can be somewhat hard to determine when / why the scheduler decides to scale the cluster under adaptive mode. Ideally a dashboard page could shed some light here. We...
Discovered in #378, the Azure integration tests are failing: ```pytb ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Timeout +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Stack of ThreadPoolExecutor-2_0 (140540411098880) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ File "/opt/conda/lib/python3.8/threading.py", line 890, in _bootstrap self._bootstrap_inner() File "/opt/conda/lib/python3.8/threading.py", line 932,...
Imbalanced datasets, where the classes have very different occurrence rates, can show up in large data sets. There are many strategies for dealing with imbalanced data. http://contrib.scikit-learn.org/imbalanced-learn/stable/api.html implements a set,...
### Feature description Some organizations have policies in place requiring certain features be enabled on AKS. For example, this subscription has a policy requiring that [AKS-managed Azure Active Directory integration](https://learn.microsoft.com/en-us/azure/aks/managed-aad#azure-ad-authentication-overview)...
I got a mildly confusing error message from this: ```python import pangeo_forge_recipes import datetime import requests from pangeo_forge_recipes import patterns from pangeo_forge_recipes.storage import StorageConfig, FSSpecTarget, CacheFSSpecTarget from pangeo_forge_recipes.recipes import XarrayZarrRecipe...
Currently, `FSSpecTarget` and friends try to make a "directory" as part of their `__post_init__`. https://github.com/pangeo-forge/pangeo-forge-recipes/blob/16eb6bfd23a26997523b86c0fe5428103ec95b8a/pangeo_forge_recipes/storage.py#L129-L132 For Azure / adlfs, this means trying to make a new storage container (think S3...
Just capturing a thought on how to handle secrets. There are two main kinds: 1. Bakery secrets: Things like "credentials to write to the {azure / aws / google} bucket"...
#### Minimal, reproducible code sample, a copy-pastable example if possible This example times an encode / decode cycle on some random bytes. It's slower by a factor of 2x -...