Ryan Blue comments

Results 205 comments of


                                            Ryan Blue

GH-3223: Implement Variant parquet writer

Merged! Thanks @cashmand for getting this in. Nice work!

Core: Fix the behavior of IncrementalFileCleanup when expire a snapshot

@amogh-jahagirdar, @hantangwangd, I'm not sure that incremental cleanup is doing anything wrong here. Incremental cleanup deletes data files when the snapshot that removed them from the table is expired. If...

Allow sparksql to override target split size with session property

I thought this was already supported, but I don't see it. The way we did this at Netflix was to add a table-specific property to SQL, like `spark.sql.iceberg.db.table.split-size=...`. This is...

Spark3 structured streaming enable updates

I think this is a good idea. I'll put it in my queue to review.

Core: Add metrics reporter for serializable table

@nastra, where are we with the fix for serialization here?

Push down group by for partition columns

Oops, I didn't mean to close this! I want to work on getting it in next

Parquet: Implement column index filter and update row read path to support page skipping

@zhongyujiang can you please add more to the description about what is included here and how you solved the problems with record materialization?

Field comments are not written for timestamp field

@itaise, it looks like you're saving the table as Parquet, not Iceberg. That could be a problem.

Spec: Fix table of content generation

I think this is fine. @danielcweeks is the organization what you want?

Core: Interface based DataFile reader and writer API

While I think the goal here is a good one, the implementation looks too complex to be workable in its current form. The primary issue that we currently have is...