keepsake issues

Get checkpoints by ID in Python

It should be possible to load a specific checkpoint by ID in Python. For example: ```python checkpoint = replicate.checkpoints.get("abc123") ``` This would be useful for using the checkpoint ID as...

bfirsh

type/enhancement

Remove dependency on gcloud command

Got this error on a blank machine: [`gcloud` also doesn't support `GOOGLE_APPLICATION_CREDENTIALS`, so you have to use `CLOUDSDK_AUTH_CREDENTIAL_FILE_OVERRIDE` to authenticate.](https://serverfault.com/questions/848580/how-to-use-google-application-credentials-with-gcloud-on-a-server)

bfirsh

type/bug

help wanted

priority/medium

Vendor pkg_resources?

We use the `pkg_resources` package to retrieve imported packages at runtime. It's provided by [setuptools](https://setuptools.readthedocs.io/en/latest/pkg_resources.html), which is not guaranteed to be available. Perhaps we should vendor it like we do...

andreasjansson

type/enhancement

Add support for NO_COLOR

We should support [NO_COLOR](https://no-color.org/) in the CLI, implemented in the `console` package.

andreasjansson

type/enhancement

Better handling of metadata load failures

When experiment or heartbeat metadata fails to load, we currently just output a warning to the console. This might be fine, but it might also cause unexpected results e.g. if...

andreasjansson

type/enhancement

Speed up experiment delete

When deleting experiments, we currently iterate through all checkpoints sequentially to delete saved tarballs. This is slow for experiments with lots of checkpoints. We should parallelize this. See also https://github.com/replicate/replicate/issues/332

andreasjansson

help wanted

type/enhancement

checkpoint.open() shouldn't read entire file into memory

# Problem Model files can be huge, but `checkpoint.open()` currently works by returning `io.BytesIO(f.read())`. This is a bodge since it allows us to immediately delete the temporarily downloaded experiment files....

andreasjansson

type/enhancement

Consolidate delete logic

We currently have separate logic for deleting experiments in both Go and Python. It ought to be consolidated, preferably by exposing a single delete method through the Go RPC API.

andreasjansson

help wanted

type/chore

Add --json flag to replicate diff

Would be great for programmability, for example in a CI setting.

andreasjansson

help wanted

type/enhancement

Generalize replicate diff to >2 experiments/checkpoints

Often you want to compare across a bunch of experiments/checkpoints, but replicate diff is currently limited to two entities. Diffing more than two source code files is hard (both computationally...

andreasjansson

type/enhancement

keepsake
keepsake copied to clipboard

Metadata

Get checkpoints by ID in Python

Remove dependency on gcloud command

Vendor pkg_resources?

Add support for NO_COLOR

Better handling of metadata load failures

Speed up experiment delete

checkpoint.open() shouldn't read entire file into memory

Consolidate delete logic

Add --json flag to replicate diff

Generalize replicate diff to >2 experiments/checkpoints

← Metadata

Owner

Metadata

keepsake keepsake copied to clipboard

Metadata

← Metadata

Owner

Metadata

keepsake
keepsake copied to clipboard