Ryan Hatter

Results 64 comments of Ryan Hatter

I'm able to reproduce. I'd be happy to take this one.

There are a couple things going on that are causing some confusion: 1. Currently the [API for `GET`ting a `DatasetVersion`](https://github.com/MarquezProject/marquez/blob/main/api/src/main/java/marquez/api/DatasetResource.java#L91-L131) gets version data from [`DatasetVersionDao`](https://github.com/MarquezProject/marquez/blob/main/api/src/main/java/marquez/db/DatasetVersionDao.java). The queries in `DatasetVersionDao` use...

Yeah, specifically for contributing to Marquez. Maybe a section in CONTRIBUTING.md that touches on the `--build` flag to build a local image from the developer's current branch

Ultimately I agree with @wslulciuc. The queries `DatasetVersionDao.findBy`, `DatasetVersionDao.findByUuid`, and `DatasetVersionDao.findAll` use the column `version`, which, as I understand, is intended to be the internal/private DatasetVersion identifier. These queries (and...

I might be misunderstanding, but `version` in this context is a UUID, so I don't think a collision with the version id of another system is a concern.

I'm interested in taking this on, but I'm not sure I have enough context to pick it up 😅 . Is the idea to pull facet data for `dataset_version`, `job_version`...

> What's the reasoning behind steps 2 and 3? In my opinion, the `version` field is confusing. We're dealing with a DatasetVersion already; does it make sense that a version...

@collado-mike @wslulciuc Is there anything additional I need to do to bring this to a vote or anything like that?

> -0.9 for this change, reaching the parallelism limits is a normal and expected thing and we should not warn the user when it happens. This is only true for...

> > This is only true for users whose Airflow environment is somewhat optimized. There are a lot of configs to consider (parallelism, worker concurrency, pools) in order to limit...