reuster986

Results 10 issues of reuster986

I think these are great candidates for the next server-side objects because: - CI is already written - They are fairly simple wrappers around single integer arrays without too much...

enhancement
In Progress

I have created a branch with a benchmark workflow script (https://github.com/Bears-R-Us/arkouda/blob/workflow-benchmark/benchmarks/workflow.py) that is intended to simulate a full workflow involving a lot of the akutil functionality we recently ported in....

enhancement
question

PR #1060 introduced a way to do computations on strings that follow the locality of the bytes, rather than the segment pointers. For hashing strings, this approach has a little...

performance

In the discussion of PR #618 @bradcray mentioned that adding `-schpl_serializeSlices=true` to the makefile would help the performance of slicing distributed arrays. He mentioned having brought it up in the...

question
performance

When `ak.abs()` is called on an `int64` array that includes the value -(2**63), it fails to flip the negative sign. Reproducer: ```python >>> x = [-1, -(2**63), 1] >>> ak.abs(x)...

bug

Multiple users have expressed a moderate desire to be able to save arrays to a single HDF5 file, as opposed to one file per locale. I think this is possible...

enhancement
Chapel
In Progress

I can't seem to find the discussion right now, but we've been mulling over whether to explore a different sorting algorithm, such as the `twoArrayRadixSort` already in Chapel. The current...

Desired functionality: * Ability to groupby multiple keys (requires multi-column argsort from #94) * get_group(key) function that returns the segment corresponding to a key * expand(vals) function that takes an...

enhancement

The current `GroupBy` logic for strings is optimized for grouping a single array of long (or variable-length) strings. However, there are at least two other cases that could benefit from...

performance

The [database-like ops benchmark](https://h2oai.github.io/db-benchmark/) compares the performance of a dozen or so frameworks on common EDA tasks, all involving groupby or join. Because of its emphasis on groupby/join, this is...

enhancement