hamilton icon indicating copy to clipboard operation
hamilton copied to clipboard

Metadata emission

Open skrawcz opened this issue 3 years ago • 1 comments

Is your feature request related to a problem? Please describe. Hamilton encodes a lot of metadata that lives in code. It also creates some at execution time. There are projects such as https://datahubproject.io/, https://openlineage.io/ that capture this metadata across a wide array of tooling to create a central view in a heterogenous environment. Hamilton should be able to emit metadata to them.

Describe the solution you'd like A user should be able to specify whether their Hamilton DAG should emit metadata. This should play nicely with graph adapters, e.g. spark, ray, dask.

UX questions:

  1. Should this be something in the graph adapter universe? E.g. a mixin?
  2. Or should this be on the driver side, so you change drivers for functionality, but change graph adapters for scale...

skrawcz avatar Jun 21 '22 21:06 skrawcz

Adding a custom source for Datahub:

  • https://datahubproject.io/docs/metadata-ingestion/developing/
  • https://datahubproject.io/docs/metadata-ingestion/adding-source/

skrawcz avatar Jun 22 '22 03:06 skrawcz

FYI - @gravesee - seems like I created this a while back for emission of metadata/usage.

Would love any help in providing a motivating use case!

skrawcz avatar Dec 27 '22 21:12 skrawcz

We are moving repositories! Please see the new version of this issue at https://github.com/DAGWorks-Inc/hamilton/issues/30. Also, please give us a star/update any of your internal links.

elijahbenizzy avatar Feb 26 '23 17:02 elijahbenizzy