Dave Berenbaum

Results 144 issues of Dave Berenbaum

https://github.com/iterative/dvc/pull/10164 will introduce `datasets` as a new type of dependency that aren't based on the local filesystem. This same mechanism can be used to stream data from other DVC repos....

p1-important
A: api

https://github.com/iterative/dvc/pull/10164 will introduce `datasets` as a new type of dependency that aren't based on the local filesystem. This same mechanism can be used to support cloud-versioned data. Users can specify...

p1-important
A: api
A: cloud-versioning

# Bug Report ## Description When adding new data and forgetting to push to a cloud-versioned remote, `dvc data status --not-in-remote` still reports a clean status. ### Reproduce ```bash $...

p1-important
A: status
A: cloud-versioning

Add support for: - [ ] `dvc gc --not-in-remote` (see https://github.com/iterative/dvc/pull/9350) - [ ] `dvc gc --cloud`

p3-nice-to-have
A: gc
A: cloud-versioning

See ~#5029~ (edit: https://github.com/iterative/dvc.org/issues/5029) and the related issues linked there for background. Rather than document the environment variables to auto push experiments, we could make this part of the studio...

p1-important
A: experiments
A: cli

[Auto-pushing checkpoints](https://dvc.org/doc/user-guide/experiment-management/checkpoints#caching-checkpoints) was introduced to make it easier to recover long-running model training jobs in CI. For long-running processing jobs over multiple pipeline stages, the same behavior should be available...

feature request
p2-medium
A: experiments
A: pipelines

CLI command and API method to get all stage info (cmd, params, deps, outs, etc.). * CLI command is useful for seeing the resolved info for stages using interpolation *...

p1-important
A: api
A: cli
A: pipelines

Closes https://github.com/iterative/dvc-bench/issues/468 Edit: the need for this is to help track performance for issues like https://github.com/iterative/dvc/issues/9085

Copied from [slack](https://iterativeai.slack.com/archives/CB41NAL8H/p1698325652730299?thread_ts=1698299109.693519&cid=CB41NAL8H) I’m able to reproduce it using the aws sandbox: ``` $ git clone [email protected]:dberenbaum/download-dvc-dir.git $ cd download-dvc-dir $ dvc pull test2014 ``` This pulls data imported from...

performance
A: data-sync

btw., what happens if two people name the experiment the same way simultaneously (feels that it would have been better to have still some random id / hash in the...

discussion
A: experiments