clumper
clumper copied to clipboard
A small python library that can clump lists of data together.
**Is your feature request related to a problem? Please describe.** As mentioned in the codebase itself, `inner_join` and `left_join` methods implementation is "naive" and speedup is possible. I figured it...
Currently supporting `"second", "minute", "hour", "days", "month"`. It's in the plan to add `year` but it becomes complicated as `timedelta` doesn't allow `months` as parameter. I suppose they didn't wanted...
**Is your feature request related to a problem? Please describe.** Our main class is becoming a monolith. Currently `Clumper` class is over 1500 lines. The major contributor is the documentation...
It would be nice if we could group-by day/week/hour given a timestamp. We should first discuss a proper API before making an implementation but this would really be a nice...
**Is your feature request related to a problem? Please describe.** Looking at the API, it does look a bit odd to me that one needs to call a class method...
This dataset might be fun to check; https://hugovk.github.io/top-pypi-packages/top-pypi-packages-365-days.json Ideas; - how many of these tools come from cloud-providers? - how many of these tools are deprecated? - do these download...
``` clump.impute(a=1) ```
I noticed a job fail with this traceback: ```python Traceback (most recent call last): File "/home/vincent/Development/gh-dashb/scripts/grab_workflows.py", line 71, in typer.run(scrape_workflows) File "/home/vincent/Development/gh-dashb/venv/lib/python3.7/site-packages/typer/main.py", line 859, in run app() File "/home/vincent/Development/gh-dashb/venv/lib/python3.7/site-packages/typer/main.py", line...
Let's say that I have the monopoly dataset. I have rows such as; ```python {'name': 'Boardwalk', 'rent': '50', 'house_1': '200', 'house_2': '600', 'house_3': '1400', 'house_4': '1700', 'hotel': '2000', 'deed_cost': '400',...
The user can read in **.json** file by just renaming it to **.jsonl**. With the current code, Clumper would parse it one line as a big dictionary. Unexpected behaviour will...