Karl Genockey comments

Results 271 comments of


                                            Karl Genockey

Phantom column in lazy frame

Can reproduce. It seems the `_right` is completely ignored for whatever reason. ```python (df1.join(df2, on="a") .select(pl.col("b").filter(pl.col("b") == pl.col("foo_right"))) .collect() ) # ComputeError: column 'foo' not available in 'DataFrame' with Schema:...

`filter`ing a `concat`enated LazyFrame raises incorrect `ColumnNotFoundError` exception

Can reproduce. Seems to be one of those optimizer issues: ```python >>> df.filter(pl.col("x").eq("a")).collect(comm_subplan_elim=False) shape: (2, 2) ┌─────┬─────┐ │ x ┆ y │ │ --- ┆ --- │ │ str ┆...

`filter`ing a `concat`enated LazyFrame raises incorrect `ColumnNotFoundError` exception

@knl You can try disabling specific optimizations to check for potential causes e.g. `.collect(comm_subplan_elim=False)` The example query in this issue ran for me under 2 situations: https://github.com/pola-rs/polars/issues/15980#issuecomment-2088234408

Reading CSV, Polars seems to ignore the provided Schema

If I understand correctly, this appears to be a minimal repro? Data: ```shell wget https://nemweb.com.au/Reports/Current/Daily_Reports/PUBLIC_DAILY_202401270000_20240128040505.zip unzip PUBLIC_DAILY_202401270000_20240128040505.zip ``` `.read_csv` works as expected. ```python import polars as pl pl.read_csv( "PUBLIC_DAILY_202401270000_20240128040505.CSV", skip_rows=1,...

Reading CSV, Polars seems to ignore the provided Schema

I suppose the data is not actually needed. Simpler repro: ```python import polars as pl import tempfile with tempfile.NamedTemporaryFile() as f: f.write(b""" A,B,C 1,2,3 4,5,6,7,8 9,10,11 """.strip()) f.seek(0) df =...

Reading CSV, Polars seems to ignore the provided Schema

It looks like @filabrazilska did file a PR to address this https://github.com/pola-rs/polars/pull/15305 But it hasn't been reviewed yet. (PRs can be linked to issues with keywords: https://docs.github.com/en/issues/tracking-your-work-with-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword)

Karl Genockey

Phantom column in lazy frame

`filter`ing a `concat`enated LazyFrame raises incorrect `ColumnNotFoundError` exception

`filter`ing a `concat`enated LazyFrame raises incorrect `ColumnNotFoundError` exception

Reading CSV, Polars seems to ignore the provided Schema

Reading CSV, Polars seems to ignore the provided Schema

Reading CSV, Polars seems to ignore the provided Schema

Cryptography cant find file.

Cryptography cant find file.

concat_list raises an error or returns an empty list if one of the filtered cols inside is empty

Partition_by should returns tuples as keys when partitioning on a list of a single column, like group_by does