Jeroen van Zundert comments

Results 108 comments of


                                            Jeroen van Zundert

Apply not being optimized when using LazyFrame [Python]

Added performance label rather than feature, as the output is correct/unchanged. I just checked the Python example on latest master (July 17, 2022), and the latest assertions still fails despite...

GroupBy.pivot | aggregations options per values_column | unique count per grouping aggregation option

Hello, I have a couple of questions that would help us all: 1. Could you provide code examples of what you are trying to achieve? Please make these self-contained, so...

Request for Docs: torch

@AlexanderVanEck : if you have found a solution in the mean time, I am sure there are people around here that would appreciate your solution to learn from. My hunch...

Request for Docs: torch

Btw, I am removing the feature label, because I think this is more a question asking for an example of how to do something, rather than a request for specific...

write_parquet performs very badly on large files compared to write_csv

Updated the labels to reflect that the output is as expected, just not at the desired performance level. @vikigenius : could you provide feedback on whether the `row_group_size` parameter added...

What about using [pl.cut](https://pola-rs.github.io/polars/py-polars/html/reference/api/polars.cut.html) on the Series (no expressions interface yet it seems unfortunately), and doing a value counts? Maybe I am missing the point. (sorry, can't test this myself...

Able to add list of percentiles to df.describe()

Hello, thank you for this request. Please note that `describe` is just a short-hand iterating over the aggregation functions: https://github.com/pola-rs/polars/blob/093d55c6b4bed6b76f3d814e7e66030dac3c4f87/py-polars/polars/internals/frame.py#L2484 You could thus simply create your own (note: not tested!):...

Jeroen van Zundert

Apply not being optimized when using LazyFrame [Python]

GroupBy.pivot | aggregations options per values_column | unique count per grouping aggregation option

Request for Docs: torch

Request for Docs: torch

write_parquet performs very badly on large files compared to write_csv

Histogram of a series

Able to add list of percentiles to df.describe()

Able to add list of percentiles to df.describe()

Improve `DataFrame.describe` by adding information about missing values

Closed argument not working as expected on rolling aggregations