vaex icon indicating copy to clipboard operation
vaex copied to clipboard

Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀

Results 194 vaex issues
Sort by recently updated
recently updated
newest added

Following up on https://github.com/vaexio/vaex/issues/2087#issuecomment-1163799755, the`vaex.dataframe.DataFrameLocal.describe()` method does not work for list types. Seems those types are not included in `map_arrow_to_numpy`. ```python data = {"A": [1], "B": pa.array([["a", "b", "c"]])} df...

**Description** got the error, trying to read parquet file: ` Cannot open part-00077-c3446b0f-b1e8-469e-9f3d-4441e1651aa6.c000.snappy.parquet nobody knows how to read it.` Any thoughts how to fix it? **Software information** - Vaex version...

needed: more information

Thank you for reaching out and helping us improve Vaex! Before you submit a new Issue, please read through the [documentation](https://docs.vaex.io/en/latest/). Also, make sure you search through the Open and...

needed: more information

Hi Vaex, I'm concatenating two parquet files using vaex with the script below - initially it expected ~13 hours to do this via the progress bar, but overnight it's crept...

Implementing lazy read of images and html viewing in Jupiter. [Original POC](https://github.com/vaexio/vaex/pull/1288) The new column holds _PIL.Image_. * Open takes a directory (can be nested) of images, or a list...

I am using the Vaex library to construct a dataframe from an HDF5 file dataset and storing and retrieving it with pickle. The following error message appears when I attempt...

**Description** I tried to play with the code found on tutorial but `df.viz.heatmap(df.x, df.y, what=np.log(vaex.stat.count()+1), selection=[None, True], limits='99.7%')` yields an error. I downloaded .ipynb from [here](https://vaex.io/docs/tutorial.html) ```python df.select(df.x > 0)...

Hi all, Does someone know how to calculate the rank of a column by a group? This would be the pandas equivalent: `df.groupby('var1')['var2'].rank(ascending = False, method = 'first')` Thanks in...

Closes: https://github.com/vaexio/vaex/issues/2071 Checklist: - [x] Make unit-tests - [ ] Make tests pass

priority: medium

**Description** is it still supported I do not see changes for last 6 months by the way can it be used on Windows oS?