Results 181 issues of Tom Augspurger

This issues is dedicated to discussing the large topic of "missing" data. First, a bit on names. I think we can reasonably choose between `NA`, `null`, or `missing` as a...

On the call yesterday, the topic of mutability came up in the vaex demo. The short version is that it may be difficult or impossible for some systems to implement...

In https://github.com/pydata-apis/dataframe-api/issues/2 there seems to be some agreement that row-labels are an important component of a dataframe. Pandas takes this a step further by using them for *alignment* in many...

Split from the discussions in https://github.com/pydata-apis/dataframe-api/issues/2. To avoid the trap of "let's just match pandas", let's collect a list of specific problems with the pandas API, which we'll intentionally deviate...

I'm investigating the best way to mosiac multiple arrays into a single array. https://desktop.arcgis.com/en/arcmap/10.3/manage-data/raster-and-images/what-is-a-mosaic.htm has a nice description. I'll follow up with what I learn. If anyone has additional resources...

This rewrites the Azure deployment guide. I'll detail the major changes below, but I wanted to ask up front: does pangeo want to be in the business of maintaining these...

cc @chiaral. I think this is why you weren't getting all your workers. 15Gb / worker happens to not schedule well onto our physical machines (with ~26GB of memory). Something...

https://app.circleci.com/pipelines/github/pangeo-data/pangeo-cloud-federation/1638/workflows/9ead26f7-994d-43cd-9c4a-8bd5dcee52d4/jobs/1805 failed. We tried to start the test before the singleuser pod was ready: ``` [Staring singleuser server] % Total % Received % Xferd Average Speed Time Time Time Current...

We add the JupyterHub username as a kubernetes label to the scheduler and worker pods at https://github.com/pangeo-data/pangeo-cloud-federation/blob/41b981403993a305d438cfddce5aa43f9d0ffdd5/pangeo-deploy/values.yaml#L104-L106. This aids in monitoring since we can easily see which JupyterHub user is...

Hi all, I'm still offline for a bit, but wanted to dump some thoughts on our current setup, as of 2020-11-11. This is primarily focused on the GCP deployment (https://us-central1-b.gcp.pangeo.io/,...