Matthew Powers

Results 310 comments of Matthew Powers

@souvik-databricks - can you add some tests for these WIP functions? Think that'll make it easier for me to see what functionality you're going for here. Thanks!

@CommanderWahid - you can fork the repo and send a pull request: https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/proposing-changes-to-your-work-with-pull-requests/creating-a-pull-request-from-a-fork I look forward to receiving your contribution. This will be a valuable addition to the community.

@robertkossendey - yep, sounds awesome, thanks!

@jaceklaskowski - thanks for commenting and I agree that this would be better in `DeltaTable` instead. The [Python API for adding constraints](https://github.com/MrPowers/mack/issues/106) would probably be better as an official API...

@danielbeach - yep, this one is still open. Try to break stuff ;) For example, `mack.drop_duplicates_pkey(delta_table=deltaTable, primary_key="col1", duplication_columns=["col2", "col3"])` assumes that `col1` is in fact a unique primary key. What...

@robertkossendey - this would be great, but it really feels like this one belongs in Delta itself. Can you open an issue in [delta-io/delta](https://github.com/delta-io/delta) and see if we can get...

@souvik-databricks - yep, looks like you're thinking about this well 😉 I just found these definitions: * Natural key: an attribute that can uniquely identify a row, and exists in...

@souvik-databricks - yep, I'm aware that `pandas_api` is part of OSS PySpark, but I don't think we should be using it 😉

@robertkossendey - yea, think we can go ahead and add this feature. Feel free to propose your suggested API and we can brainstorm before it gets built. Thank you!