hudi
hudi copied to clipboard
Upserts, Deletes And Incremental Processing on Big Data.
**Describe the problem you faced** When using Hive to query xxx_rt table,if there is no parquet file but only log files, we get a wrong table path. But when the...
**Describe the problem you faced** Hudi Clustering not working. I'm using Hudi Delta streamer in continuous mode with Kafka source. we have 120 partitions in the Kafka topic and the...
22/04/18 19:49:02 INFO timeline.HoodieActiveTimeline: Checking for file exists ?/hudi/***/***_/.hoodie/20220418194506064.deltacommit.requested Exception in thread "pool-24-thread-1" org.apache.hudi.exception.HoodieUpsertException: Failed to upsert for commit time 20220418194506064 at org.apache.hudi.table.action.commit.AbstractWriteHelper.write(AbstractWriteHelper.java:62) at org.apache.hudi.table.action.deltacommit.SparkUpsertDeltaCommitActionExecutor.execute(SparkUpsertDeltaCommitActionExecutor.java:46) at org.apache.hudi.table.HoodieSparkMergeOnReadTable.upsert(HoodieSparkMergeOnReadTable.java:90) at org.apache.hudi.table.HoodieSparkMergeOnReadTable.upsert(HoodieSparkMergeOnReadTable.java:77) at...
The environment is CDH6.3.2 and Hudi is 0.11.1 I want to test delete of spark sql, there are 4 records in the table ,**vehicle_model_id is [100 101 102 105]** when...
Hi We are facing an issue when we first doing bulk_insert for batch load which we had 9 million records , we are getting two tables one is RT and...
**Describe the problem you faced** I am trying to delete from hudi table using spark apis, but I am neither observing any exceptions nor the records are getting deleted. deltacommit...
[SUPPORT] [HUDI-3601] The current Docker demo only for x86 system ,arm64 system can't run success
**_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at [email protected]. - If you...
**_Tips before filing an issue_** - Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)? - Join the mailing list to engage in conversations and get faster support at [email protected]. - If you...
**Describe the problem you faced** **Scenario #1:** 1)created a dataframe(**targetDf**) and using the below statement to write it in GCS Bucket location (for ex - **locA**) targetDF.write.format(org.apache.hudi).options(hudiWriteConf).mode(SaveMode.Overwrite).save(locA) 2)then we are...
I am doing a PoC of HUDI and I noticed that while using HoodieJavaWriteClient.java, the writes in case of MOR are taking more time when compared to COW. But when...