seatunnel
seatunnel copied to clipboard
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
close #10005 ### Purpose of this pull request ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? ### Check list * [ ] If any...
When there are special fields on the sink side, the table structure on the sink side should be used to judge. For example, if the source is mysql and the...
### Search before asking - [x] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues. ### What happened Improving Orc Writing Performance by Batch Writing Files, I test...
… ### Purpose of this pull request ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? ### Check list * [ ] If any new...
### Purpose of this pull request 修复使用 sqlserver-cdc 的earliest启动模式下出现数组下标溢出问题。原因是初始化的偏移值错误(INITIAL_OFFSET)。 原代码是用了一个字节表示,启动器时 这个代码为抛出ArrayIndexOutOfBoundsException异常 原因是对比代码这里按数组对比。比如当前 sqlsqserver cdc 的 lsn 是0x000233BA00000930016E,共 10 个字节,而用于比对的初始 lsn 只有 1 个字节,并且首个字节相同,对比到第二个字节时获取目标字节下标溢出了。 于是现在修改INITIAL_OFFSET也为 10 个字节,从而解决这个问题 ### Does this PR...
### Search before asking - [x] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues. ### What happened **Problem** After the merge of #9867, the `SupportSchemaEvolutionSinkWriter` interface contains...
### Search before asking - [x] I had searched in the [feature](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22Feature%22) and found no similar feature requirement. ### Description https://github.com/alibaba/fluss - add connector module - add docs(en/zh) - add...
### Search before asking - [x] I had searched in the [feature](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22Feature%22) and found no similar feature requirement. ### Description Hello, as the version iteration of SeaTunnel is getting faster...
### Purpose of this pull request support #9716 The main work of pr: 1. Parse pdf files based on `PdfBox`. 2. At present, four element types, namely `Heading`, `Paragraph`, `Link`...
### Search before asking - [x] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues. ### What happened Sometimes when I use seatunnel.sh -can jobid to cancel streaming...