Volker L. comments

Results 46 comments of


                                            Volker L.

Dimensionality Reduction (approximation) along columns/time axis?

Hi Johann, my fault was to think of every timestamp being a new sample and every feature being a different measure (e.g. temperature and pressure). But this is only true,...

[Python] group_by functionality directly on large dataset, instead of on a table?

Hope this is not off-topic, but you can leverage `duckdb` or `polars` for this. ```python import duckdb import pyarrow.dataset as ds import polars as pl dset = ds.dataset('path/to/data') # duckdb...

[Python] group_by functionality directly on large dataset, instead of on a table?

What is the size of the dataset and where is it stored? In a s3 bucket? If so, this could be interesting for you: https://github.com/apache/arrow/issues/14336

[Python] group_by functionality directly on large dataset, instead of on a table?

> > Thank you @legout. Duckdb works really well, but polars is struggling. Maybe I am doing something wrong. > > But anyway here is how it worked for me...

How to implement work through a proxy server?

Any news here?

Partition aware parquet scanning

I am looking forward to a polars native solutin. Current workaround for me is the following: ```python def read_parquet_dataset(path, partitioning=None, filter_=None, with_columns=None, storage_options=None): if storage_options is not None: fs=s3fs.S3FileSystem(**storage_options) files=["s3://+f...

Volker L.

Dimensionality Reduction (approximation) along columns/time axis?

[Python] group_by functionality directly on large dataset, instead of on a table?

[Python] group_by functionality directly on large dataset, instead of on a table?

[Python] group_by functionality directly on large dataset, instead of on a table?

How to implement work through a proxy server?

Partition aware parquet scanning

Support adding prefixes to `DataFrame.unnest`

Cloud Tier - backend storage type rclone not found with v3.61

Cloud Tier - backend storage type rclone not found with v3.61

Aiohttp slows down when using proxies