dolphinscheduler icon indicating copy to clipboard operation
dolphinscheduler copied to clipboard

[DSIP-79][Task] Add Datavines task to better support data quality

Open xxzuo opened this issue 1 year ago • 9 comments

Search before asking

  • [X] I had searched in the DSIP and found no similar DSIP.

Motivation

DataVines is an easy-to-use data quality service platform that supports multiple metric. https://github.com/datavane/datavines

  • Datavines supports executing multiple metrics in one job.
  • Datavines supports execution status dashboard and data quality report.
  • Datavines supports plug-in extensions for components such as metric, data sources, error data storage, and execution engines.
  • Jdbc engines can be used to execute data quality tasks instead of solely relying on Spark engines.

Design Detail

Sript mode

  1. config data quality job in datavines image

  2. get the job config scipt file

  3. Add datavines job node in workflow, and configure the script image

API Mode

  1. config data quality job in datavines image

  2. get the jobId

  3. Add datavines job node in workflow, and configure the datavines api address and jobId

Compatibility, Deprecation, and Migration Plan

No response

Test Plan

No response

Code of Conduct

xxzuo avatar Jun 03 '24 15:06 xxzuo

It would be nice if you could submit a task here and see the status of the task in ds and stop it via datavines

MYiYang avatar Jun 04 '24 00:06 MYiYang

very useful for data pipeLine

zhangp8721 avatar Jun 04 '24 01:06 zhangp8721

If the datavines are incorporated into the ds, it will be easier to integrate project management and data inspection

xiaoshiqiai avatar Jun 04 '24 10:06 xiaoshiqiai

+1

zixi0825 avatar Jun 07 '24 05:06 zixi0825

You should provide a detail design related of the how to use the new task and how does the task work in ds, rather than some pictures of ui.

ruanwenjun avatar Jun 17 '24 13:06 ruanwenjun

You should provide a detail design related of the how to use the new task and how does the task work in ds, rather than some pictures of ui.

ok, I will supplement the detail design.

xxzuo avatar Jul 11 '24 06:07 xxzuo

ok, I will supplement the detail design.

Hi, are you still working on this?

SbloodyS avatar Oct 24 '24 03:10 SbloodyS

ok, I will supplement the detail design.

Hi, are you still working on this?

I will come to do this.

zixi0825 avatar Oct 24 '24 12:10 zixi0825

Before the new task plugin is completed, shell tasks can be used to integrate datavines, refer to the following guidelines https://datavane.github.io/datavines-website/docs/integration/dolphin-scheduler

zixi0825 avatar Oct 27 '24 13:10 zixi0825

Since the Datavane can run tasks independently, why use DolphinScheduler at all? In the future, DolphinScheduler might serve only as a scheduler.

rockgarden avatar Jul 31 '25 16:07 rockgarden