Seth Michael Larson issues

Results 159 issues of


                                            Seth Michael Larson

Add support for simple slicing of DataFrames

Add support for: - `df[:]` - `df[::]` - `df[:100]` - `df[100:]` - `df[100:200]` - `df[100:][1:]` Slicing functionality that should raise a ValueError: - `df[::2]` (step) - Probably others?

enhancement

help wanted

topic:series

topic:dataframe

Ensure support for pandas 1.4.x

May be a no-op, but we need to run our test suite with pandas 1.4.x to make sure.

Add a way to access _score from DataFrame when using scoring filters

Relates to #282 it'd be nice to be able to access the `_score` value (and sort by it too). Need to find out how we should expose the `_score` information...

enhancement

topic:series

topic:dataframe

Create a test case for using Eland with a datastream

Currently we don't test (or document) that Eland should work with data streams, we should probably test that everything works properly.

documentation

enhancement

help wanted

good first issue

topic:series

topic:dataframe

Allow for sampling to be used for large/dense datasets

Pulled from: - https://github.com/elastic/eland/issues/118#issue-556293360 - https://github.com/elastic/eland/issues/118#issuecomment-590240334

topic:dataframe

Add support for pd.Grouper() in DataFrame.groupby()

- Alias `eland.Grouper()` to be the same as `pd.Grouper()` for convenience - Eventually will be used for `Grouper(..., freq="...")` with timestamp bucketing.

enhancement

help wanted

topic:dataframe

Implement DatetimeIndex

See [`pandas.DatetimeIndex`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DatetimeIndex.html) and [`date_range`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.date_range.html#pandas.date_range). End goal is to be able to visualize time-series data on a histogram.

enhancement

topic:dataframe

Allow filtering on a GroupByDataFrame

Pretty sure pandas allows you to filter once you've called groupby(), we should as well.

enhancement

help wanted

good first issue

topic:dataframe

Add TQDM progress bar for Jupyter Notebook pandas_to_eland() calls

- https://github.com/tqdm/tqdm#ipythonjupyter-integration - Units are rows/s - Maybe hide progress bar on completion? Need to test it out first. - Add an option `show_progressbar` which defaults to `None` - If...

enhancement

help wanted

good first issue

Implement [DataFrame Series].fillna() and [DataFrame, Series].dropna()

`fillna()` can be implemented as a post-processing task. `dropna()` can be implemented as a query filter For now we can support `axis` only as `axis=1` for "columns", some options may...

enhancement

topic:series

topic:dataframe