Michael Mayer
Michael Mayer
I am using dalex to plot multiple partial dependence plots in Python. It is a very useful package! For some numeric features (especially those with outliers), I would like to...
#### Description The issue might be related to https://github.com/openml/openml.org/issues, but does not seem to be solved. I need to update the description of a dataset I have uploaded. After logging...
#### What happens? Using "COPY TO" query produces much larger parquet files compared to `pyarrow.parquet.write_table()`. #### To Reproduce ``` import pandas as pd import duckdb from pyarrow import parquet as...
#### Describe the workflow you want to enable I like the inspect module very much. Sometimes, interpretation of a model is more natural on a different scale (e.g. log) than...
#### Reference Issues/PRs Implements #22383 #### What does this implement/fix? Explain your changes. @lorentzenchr This PR implements a clean version of Friedman's H^2 statistic of pairwise interaction strength. It uses...
Roxygen is now closer to Markdown. E.g., instead of the verbose ``` \itemize{ \item \code{Features} ... } ``` one can write ``` - `Features` - ... ``` This makes the...
This PR fixes https://github.com/dmlc/xgboost/issues/7118 In order to keep the `xgb.cv()` API as it is: - `data` must be an `xgb.DMatrix` object - which contains the two infos 'label_lower_bound' and 'label_upper_bound'....
Fantastic project. I would love to see the possibility to add interactions on the fly, just like H20. There, you can provide a list of interaction pairs or, alternatively, a...
In statistics, constructions like this are quite common: ``` if (length(unique(x)) > 27) { Some binning } ``` If `x` is long and continuous, calling `unique()` seems inefficient (even if...
Thanks a lot for your awesome package. I keep having troubles with LightGBM's `categorical_feature` support (the data is integer coded, but the algo internally treats them as unordered) and SHAP...