seatunnel icon indicating copy to clipboard operation
seatunnel copied to clipboard

[Improve][Connector-V2] Enhance FakeSource connector

Open hailin0 opened this issue 2 years ago • 9 comments

Search before asking

  • [X] I had searched in the feature and found no similar feature requirement.

Description

Thoughts on enhance FakeSource:

  • [x] SeaTunnelSchema support parse row type from config file #2765
  • [ ] Support define row type
  • [x] Supports define the total number of rows of data
  • [ ] Supports direct definition of data values(row)
  • [ ] Support value's range to avoid exceed in some connectors.
  • [ ] Support random generate null value for validating connector's stability.
  • [ ] Support more than splits and parallelism #2961

Anyone willing to help?

Usage Scenario

No response

Related issues

No response

Are you willing to submit a PR?

  • [ ] Yes I am willing to submit a PR!

Code of Conduct

hailin0 avatar Sep 16 '22 04:09 hailin0

Now SeaTunnelSchema can not parse row type from config file, so we should upgrade it first. I can do this work.

TyrantLucifer avatar Sep 17 '22 08:09 TyrantLucifer

Please assign it to me, I am interested in FakeDatasource and console

laglangyue avatar Sep 17 '22 13:09 laglangyue

I think fake source need support define the key fields and generate unique value for key fields. It's very useful to test exactly-once sink which implement exactly-once by Idempotent write data.

EricJoy2048 avatar Sep 20 '22 02:09 EricJoy2048

  • Support value's range to avoid exceed in some connectors.
  • Support random generate null value for validating connector's stability.

hk-lrzy avatar Sep 21 '22 12:09 hk-lrzy

我认为假源需要支持定义关键字段并为关键字段生成唯一值。测试通过幂等写入数据exactly-once实现的接收器非常有用。exactly-once

good idea, I will do it

laglangyue avatar Sep 21 '22 14:09 laglangyue

  • Support value's range to avoid exceed in some connectors.
  • Support random generate null value for validating connector's stability.

I will do it but lately

laglangyue avatar Sep 21 '22 14:09 laglangyue

I suggest FakeSource support parallel, I want to use FakeSource and LocalFileSink test the data consistency while some engine node shutdown(For SeaTunnel Engine integration testing). The FakeSource task will run in more than one cluster node only if it support parallel.

EricJoy2048 avatar Sep 28 '22 06:09 EricJoy2048

  • [ ] Support more than splits and parallelism. please assign this task to me. I am working for customize parallelism.

liugddx avatar Sep 30 '22 14:09 liugddx

This issue has been automatically marked as stale because it has not had recent activity for 30 days. It will be closed in next 7 days if no further activity occurs.

github-actions[bot] avatar Nov 25 '22 00:11 github-actions[bot]