datatable
datatable copied to clipboard
A Python package for manipulating 2-dimensional tabular data structures
Not sure if there is an existing ticket, so creating this. Adding a `na.rm`(similar to R) or `skipna` (pandas flavour) parameter in aggregations might be useful I observed this in...
This is a feature request to implement AVRO and ORC reads into datatable. The ORC read function would be akin to the Pandas implementation: ```python df = pd.read_orc("my_orc_file") ``` To...
Hi, apologies if I duplicated anything but I didn't find anything like this: As far as I have seen it is only possible to use replace('a', 'b') for the whole...
Following the documentation, you can get the count per group using: ``` x[:, { 'count': dt.count() }, by('id') ] ``` When trying to also include the percentile of a column...
Currently it is impossible to create a datatable frame from numpy arrays with unsigned integer types: ```python >>> import datatable as dt >>> import numpy as np >>> NP =...
Right now datatable reducers are only parallelized in terms of the groups. When there is one group, even if it is huge, datatable does all the work in single thread....
```python from datatable import dt, f, sort DT = dt.Frame([[1], [1]]) DT[:, :, sort(f[:])] ``` produces ```python AssertionError: Assertion 'flags.size() == ncols' failed in src/core/expr/eval_context.cc, line 264 ```
I am reading a large .csv file but when I try to print the first row or any other operation I get the following error `RuntimeError: Unknown stype time64 `...
RIght now datatable frames cannot be created from `pd.array()` ```python >>> import datatable as dt >>> import pandas as pd >>> PD = pd.array([1, 2, None]) >>> DT = dt.Frame(PD)...
`.replace()` [should operate](https://datatable.readthedocs.io/en/latest/api/frame/replace.html) on columns of types appropriate for the `replace_what` value. At the same time, we [declare](https://datatable.readthedocs.io/en/latest/api/type/void.html) that `void` columns can be used in place where any other type...