vaex
vaex copied to clipboard
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
Hi, I have a large time series dataset in which I am using vaex for the backend of a graph to dynamically downsample/upsample when zooming in and out. I am...
This gives an option to try to not use memory mapping, which might be useful for debugging, or to inspect the performance under OSX using mmap or not.
Proposal: since we are soon moving to a new major version, I propose we remove some of the deprecated methods, like `label_encode`, especially since there are methods superseding them (`ordinal_encode`...
This PR enables the subtraction a datetime64 expression from a constant np.datetime64 values. Note: the reverse works, i.e. subtracting a constant np.datetime64 value from a datetime expression works well. The...
This PR fixes several (hopefully) small issues that pop up when using string columns with vaex, now that it is primarily arrow based. List of new / updated unit tests:...
Fixes #856 @JovanVeljanoski : could you finish this further? Basically we have to repeat this for every item in 4be22b7f102a6873f5d4d24c135c73f14a167cc2
implement a cardinality measure. Simply put: ```len(df.x.unique())/len(df)``` Notes: Cardinality By definition is just the count of different elements, so maybe the naming is not optimal.
These masks are pointless
Hello, author. Recently, I encountered an issue with vaex when using vaex dataframe.rolling. When I attempt try_rolling_obj = try_group_df.rolling(window=window, trim=False, edge='right', column=['c']), even though my data is complete without any...