melin

Results 38 comments of melin

> Hi, @zyclove > > Thanks for your feed back. Supporting hudi is a very useful feature for arctic and we are planning put it into our roadmap. But It...

不建议采用flinkcdc 方案,被绑定到flinkcdc 版本依赖,flInkcdc 现在活跃度不是很高,高级功能只有云产品有,依赖debezium版本低,有bug 迟迟不能升级。可以考虑基于debezium-server 来设计实现,这样兼顾到多引擎。 参考这个项目:https://github.com/memiiso/debezium-server-spark https://github.com/memiiso/debezium-server-spark/blob/master/docs/images/debezium-batch.png

> @melin Which registry version are you using? Pl send mysql server details including storage engine. hortonworks-registry version: 0.5.4 mysql version: 5.6.35

> I am not very clear about the purpose of providing antlr files. Is it just for developers to understand ngql syntax or will you use antlr to parse ngql?...

Implement CDC data synchronization hudi based on Debezium Server https://github.com/apache/hudi/issues/6853 @CalvinKirs

支持限流,没有限流不是一个完整数据同步平台

Add catalogName parameter to MetadataStore interface method

> I feel this is more like a snapshot switch and publishing procedure. Do you mean Hudi cannot support this? After the data snapshot is generated, it is not visible...

> try set 'as.of.instant' to time travel? hudi commited > try set 'as.of.instant' to time travel? After hudi is committed, it can be directly queried as the latest snapshot. In...

For the data written by spark sql, avoid quality problems (for example, a field has an invalid value, or the data volume fluctuates greatly compared to the previous cycle), and...