Jeroen van Zundert

Results 108 comments of Jeroen van Zundert

Added performance label rather than feature, as the output is correct/unchanged. I just checked the Python example on latest master (July 17, 2022), and the latest assertions still fails despite...

Hello, I have a couple of questions that would help us all: 1. Could you provide code examples of what you are trying to achieve? Please make these self-contained, so...

@AlexanderVanEck : if you have found a solution in the mean time, I am sure there are people around here that would appreciate your solution to learn from. My hunch...

Btw, I am removing the feature label, because I think this is more a question asking for an example of how to do something, rather than a request for specific...

Updated the labels to reflect that the output is as expected, just not at the desired performance level. @vikigenius : could you provide feedback on whether the `row_group_size` parameter added...

What about using [pl.cut](https://pola-rs.github.io/polars/py-polars/html/reference/api/polars.cut.html) on the Series (no expressions interface yet it seems unfortunately), and doing a value counts? Maybe I am missing the point. (sorry, can't test this myself...

Hello, thank you for this request. Please note that `describe` is just a short-hand iterating over the aggregation functions: https://github.com/pola-rs/polars/blob/093d55c6b4bed6b76f3d814e7e66030dac3c4f87/py-polars/polars/internals/frame.py#L2484 You could thus simply create your own (note: not tested!):...

See the `qs` input argument, that is the percentiles as fractions: ```python df = pl.DataFrame(...) describe(df, qs=[0.1, 0.25, 0.5, 0.75, 0.9]) ```

@philss : that seems sensible to me, but I'm not a Rust programmer. So that will have to be someone else. Feel free to raise a PR.

@ML-BCB : I'm trying to understand your example. Could you: 1. Make the dummy data much smaller, and not using random functions but just hard coded values? That makes it...