Scott Sandre

Results 160 comments of Scott Sandre

@tdas > we don't run python tests for spark master at the moment

delta-io/delta#3508

@sclmn are you saying that when `delta.checkpoint.writeStatsAsStruct` is true, delta-spark is not writing out the `stats_parsed` field in the delta checkpoint? That seems like a bug. Thanks for pointing this...

@prakharjain09 can you take a look?

Hi @peeyushgupta1 -- sorry for the delayed response! Yup we have plans for this to be picked up starting roughly next week!

@dhruvarya-db and @felipepessoto -- is this considered a bug or correctness fix? Thats' the only reason we would want to backport it to the 4.0 branch

@michelleon -- That's a great suggestion. Will implement that. Thanks!

Hi @Kimahriman, thanks for bringing this to our attention. We will look into this.

Another observation of mine: not sure if our hashing is even/fair enough. The max difference between any two threads in a given shard is 68% (i.e. one thread has 68%...

![image](https://github.com/user-attachments/assets/427001a0-d81e-41c9-8f36-6c4e0ef98551) 24 minute disparity between first and last