KnightChess

Results 69 comments of KnightChess

@vinothchandar @xushiyan @yihua Could you help answer this question?

@nsivabalan thanks reply, I use 0.11.0 version. But we are batch job, not streaming job. Follow the config which you advice in code, I found the execption processing logic in...

for RO table delta commit, It can only be seen after compact action

make `spark.sql.parquet.mergeSchema` to true I think can solve this error. but may be there has diff schema data file in this partition? I think we need to check it.

@ehurheap ```shell 22/07/22 19:18:58 ERROR SparkMain: Fail to execute commandString org.apache.spark.sql.AnalysisException: cannot resolve '_hoodie_record_key' given input columns: []; line 5 pos 15; 'UnresolvedHaving ('dupe_cnt > 1) +- 'Aggregate ['_hoodie_record_key], ['_hoodie_record_key...

@ehurheap look like `repair deduplicate` target is base file. Is there any duplicate data when you use query or other op? For log file, I think it will combine the...

Hello @codope , I have a question. In problem `b` you mentioned . why not delete the extra orphaned files by marker file and commit metadata like cow table, but...

@danny0405 sorry, I don't quite understand. For example, a write task create a new a_0-0-1.logfile, and this's speculation task also create a new a_0-0-2.logfile, these two files are the same...