spark-sql-flow-plugin
spark-sql-flow-plugin copied to clipboard
Supported Insert as select sql
example: insert into table bigdata.test_child_lineage select * from bigdata.test_child_dt where ds='20211207'
There is a lineage relationship between test_child_lineage and test_child_dt
Yea, but it is not easy to implement it because Spark does not hold the relationship in its catalog. Probably, we need a custom SQL listener to track INSERT
query logs.
Have a plan to finish? Thanks!
I don't have it because I don't have a smart idea for implementing it. Any plan to use the feature if it's implemented?
Will use, most of the scheduling job that run every day are insert sql job
https://zhuanlan.zhihu.com/p/502927331?utm_source=wechat_session&utm_medium=social&utm_oi=27427107504128&utm_campaign=shareopn&s_r=0 The article has description to solve this problem