forust issues

Memory Consumption of GradientBooster Remains Constant Despite Increased Iterations - Comparison to XGBoost and LightGBM

4

Hi there! I'm using the forust library and have noticed a curious behavior regarding memory consumption. Unlike popular libraries like XGBoost and LightGBM, where memory usage increases significantly with a...

DmitrySorda

pivot_on_split O(n)

3

I guess the following implementation has O(n) complexity. Correct me if I am wrong. It passes all the tests for `pivot_on_split`. It became hacky to handle edge cases. If you...

deadsoul44

Speed up Shapley calculations on Windows

For some reason, Shapley runs much slower on windows than Linux. ```python import forust import xgboost as xgb import seaborn as sns import numpy as np df = sns.load_dataset("titanic") X...

jinlow

MessagePack serialization option

1

Would you be open to adding a feature allowing for the option of serializing a GradientBooster struct using MessagePack instead of JSON? If so, I'd be happy to submit a...

Roee-87

enhancement

gravesee

Considering sampling data to determine cuts for bins

Currently in the gradientbooser fit method, all of the data is used for determining cuts for binning the data. It would like speed things up, if we allowed for a...

jinlow

forust
forust copied to clipboard

Metadata

Memory Consumption of GradientBooster Remains Constant Despite Increased Iterations - Comparison to XGBoost and LightGBM

pivot_on_split O(n)

Speed up Shapley calculations on Windows

MessagePack serialization option

Align `eval_set` with scikit-learn

Consider unbiased gain measure

consider making max_leaves, max_depth randomly drawn for each tree

Consider adding a `feature_bounds` parameter

Refactor evaluation flow

Considering sampling data to determine cuts for bins

← Metadata

Owner

Metadata

forust forust copied to clipboard

Metadata

← Metadata

Owner

Metadata

forust
forust copied to clipboard