Dave Berenbaum

Results 144 issues of Dave Berenbaum

The benchmarks are now all focused on data management. We need a use case focusing on experiments. This should have many revisions and exp refs and include: * `dvc exp...

p1-important

Need a benchmark to test `repro` performance, including: - Multi-stage pipelines - Large deps and outputs in terms of size and num files - Multiple pipeline iterations - With and...

p1-important

Add benchmarks for cloud versioning remotes

Between 2.9.1 and 2.6.3, there was a 6.68x regression in the `test_sharing-add-noop` benchmark:

We were ignoring the studio token and other studio fields from the user's config during `dvc artifacts get`

Fixes https://github.com/iterative/dvc/issues/10255. This is a partial reversion of https://github.com/iterative/dvc/pull/9246. I'm not sure this is the cleanest or best way to do it, but it solves the problem. It builds the...

See https://github.com/iterative/dvcx/pull/1321/files#r1547778555. `dvc.api.get_dataset()` should return something like `{"name": "dogs-and-cats", "version": 1, "uri": "ds://dogs-and-cats@v1"}` (adding the `"uri"` field).

p2-medium
A: api
A: data-management

Following up on https://github.com/iterative/dvc/issues/10313 and related new features specifying `datasets` as dependencies, we can add more types of supported datasets: - [delta lake](https://iterativeai.slack.com/archives/CB41NAL8H/p1709267375350649) - hugging face This could allow for...

feature request
p2-medium
A: data-management