tiflow icon indicating copy to clipboard operation
tiflow copied to clipboard

Changefeeds are not distributed evenly over TiCDC nodes

Open King-Dylan opened this issue 1 year ago • 4 comments

What did you do?

Creating multiple changefeeds where each one contains only one table results in all table tasks being sent to a single capture. This is unreasonable. I think before we bind a table to a capture, we should add more judgment conditions, such as considering the current tasks already bound to it. Then, we can choose the best capture to bind the table to, which will be better for this scenario. https://github.com/pingcap/tiflow/blob/master/cdc/scheduler/internal/v3/scheduler/scheduler_basic.go#L81-L82

What did you expect to see?

The tasks evenly distirbuted on different ticdc nodes.

What did you see instead?

All of them are bound on specific node.

Versions of the cluster

TiCDC version (execute cdc version):

v7.1.1

King-Dylan avatar Jun 20 '24 18:06 King-Dylan

https://github.com/pingcap/tiflow/issues/11331 seems just fixed the uneven distribute scenario for one changefeed muti-tables. This scenario is more like multiple changefeeds with one table task.

King-Dylan avatar Jun 20 '24 18:06 King-Dylan

#11331 seems just fixed the uneven distribute scenario for one changefeed muti-tables. This scenario is more like multiple changefeeds with one table task.

@King-Dylan The link in the comment is wrong?

fubinzh avatar Jun 21 '24 06:06 fubinzh

https://github.com/pingcap/tiflow/pull/9847 This is the previous issue that @King-Dylan was referring to.

choudharypankaj avatar Jun 26 '24 18:06 choudharypankaj

This looks like an enhancement? Please @flowbehappy take a look.

fubinzh avatar Jun 28 '24 01:06 fubinzh

We are working on the new arch TiCDC https://github.com/pingcap/ticdc , will address it in the new project.

flowbehappy avatar Nov 14 '24 04:11 flowbehappy