Niels Bantilan

Results 468 comments of Niels Bantilan
trafficstars

It might make sense to bring back the warning that `hypothesis` raises with `example`. It's really meant more for interactively debugging and examining strategies, and not for any serious production...

closing this issue, @tmcclintock FYI I created https://github.com/unionai-oss/pandera/issues/1625 to articulate what would be needed to improve the performance of pandera strategies overall.

hi @koalp thanks for the PR! This is blocked by a big pandera internals re-write: https://github.com/unionai-oss/pandera/pull/913, I'll ping you here once that's merged, as the changes you've added will need...

Hi @andrew-infogrid this is indeed a bug! Very strange... it may have to do with `check_types` not understanding `pat.DataFrame` objects, will need to investigate

Oh, I see what's happening. So this is a gotcha due to inplace-mutation of data. When you define a pandera-typed dataframe with `pat.DataFrame[Schema](...)`, pandera injects a `df.pandera.schema` into the dataframe....

> I don't, however, understand why with_pydantic=True changes this behaviour though Yeah, not sure either, will need to investigate. Btw can you share a code snippet of code you'd typically...

Thanks for the edited snippet, I'll play around with this over the next few days. > My big concern is that any method inherited from pd.DataFrame that returns self (or...

This seems like a useful feature! I'd support this effort. @NeerajMalhotra-QB @jaskaransinghsidana @filipeo2-mck any thoughts on this? It would incur significant compute cost, since right now the pyspark checks queries...

Okay, so high level steps for this issue: 1. Introduce `PANDERA_FULL_TABLE_VALIDATION` configuration. By default it should be `None` and should be set depending on the validation backend. It should be...

Happy to help review and take a PR over the finish line if someone wants to take this task on @zaheerabbas-prodigal