Joel

Results 347 comments of Joel

If that's the case, we should be able to write a `kedro upgrade` function?

This is slightly unscientific, but I trust the vibes in the industry enough to say Iceberg will clearly be the winner in long term. ![image](https://github.com/user-attachments/assets/766d8b8b-5001-47ef-868e-cb222612df6a) Plus people saying things like...

Delta is 100% more mature, Iceberg is the horse to back. This is the thread I was trying to find earlier: https://x.com/sean_lynch/status/1845500735842390276 I also don't think we should be wedded...

So I'm actually being bullish and saying we _should_ pick one of these when it comes to **our idea of versioned data**. We simply don't have capacity to integrate everywhere...

Super cool application of these concepts https://datamonkeysite.com/2024/11/10/smart-data-pipeline-design-check-for-delta-table-changes-with-minimal-overhead/

>I'm with @deepyaman on this one. There should be a layer in Kedro that is format-agnostic. We can be more opinionated in a higher layer. I just want to warn...

My view: >1. Does it make sense to use it for anything else but versioning tabular data? I'm willing to bet >95% of use cases fall into this. > 2....

Thanks Nok this is super helpful, in general `upserts` don't canonically make sense in Kedro

Also search by kind - if I wanted to find all Parquet files today I'd have to get very creative. Retrieving paths associated with those would be super complicated.

Why not expose `catalog.keys()` I think they need a way of accessing things declared in the catalog in interactive environments