datatable
datatable copied to clipboard
A Python package for manipulating 2-dimensional tabular data structures
When doing ``` ans = x(select=mean(f.v1), groupby="id1") ``` we are getting aggregated results that are also sorted by `id1`. It is sometimes useful to retain original order of values from...
I am not 100% sure whether this is a bug or I'm just doing something wrong, but based on the documentation, this behavior should not happen the way it is....
A linear model we are going to add to `datatable.models` should support binomial and multinomial classifications, as well as numeric regression. It should be a regularized model with stochastic gradient...
Hi, would it be possible to support Pandas ExtensionTypes like [IntervalArray](https://github.com/pandas-dev/pandas/blob/526468c8fe6fc5157aaf2fce327c5ab2a3350f49/pandas/core/arrays/interval.py#L190)s? Right now, it is not possible to read any PyArrow table containing StructTypes using datatable. Also, I would like...
I'd like to filter rows according to functions like ``` lambda x: x[0] in my_list ``` which use pythonic syntax (syntactic sugar). Currently I need to rewrite this to primitive...
Hi, guys! First of all want to say that you do a great thing. And I want to help to make it more fully, useful and popular. I have a...
Does it suppot the cutomize function for the aggregation in j? For example, if I need to obatin the length of each group, I would write an `anyfunction` to calculate...
See this comment: https://github.com/h2oai/datatable/pull/2914#discussion_r604341858
- Did you find a bug in datatable, or maybe the bug found you? `f` symbol accepts mixed-selector types in the `j` section, which is not allowed when not selecting...
Improving random attacker, it was found that we need to adjust datatable behavior in terms of the operations on keyed frames. Since keying a frame is an expensive operation, as...