evalml issues

TimeSeriesImputer should not allow interpolate as strategy for boolean or categorical targets

Currently the `target_impute_strategy` is applied to any kind of target data, independent of whether or not the strategy makes sense for that kind of data. This is only problematic for...

tamargrey

SimpleImputer can raise `TypeConversionError` if `mean` or `median` strategy used with boolean data

1

The following code will attempt to use the `mean` and `median` strategies with boolean data, which converts the values to floats and then imputes whatever the mean and median of...

tamargrey

Remove prophet-specific testing

1

With the resolution of #3908, prophet is no longer a "special" dependency that can only be installed on some machines and needs explicit testing. We should remove our prophet-specific make...

eccabay

Make `_drop_time_index()` public and call when using `transform_all_but_final()`

- As a user of EvalML, I expect an estimator-ready dataframe when I call `transform_all_but_final()`. However, for time series problems, this dataframe includes the datetime column even if it is...

christopherbunn

enhancement

refactor

TimeSeriesImputer: Remove nullable type handling when pandas adds nullable type support for interpolate

Once https://github.com/pandas-dev/pandas/issues/41565 has been implemented and released, we should upgrade pandas to that version, which will allow us to remove the nullable type handling put in place by https://github.com/alteryx/evalml/issues/4001.

tamargrey

ARIMARegressor should fill boolean nans with the mode then convert to double to fully support boolean values

3

- As a user, I wish I could pass any boolean column into the ARIMARegressor component. Currently, if you pass in a column with the `Boolean` logical type, we convert...

tamargrey

new feature

Use enable_categorical flag in XGBClassifier to avoid encoding categorical features

- With XGBoost 1.5.0, you can now use `enable_categorical` argument to pass categorical data (which avoids us needing to one-hot encode categorical columns) - https://xgboost.readthedocs.io/en/stable/python/examples/categorical.html#sphx-glr-python-examples-categorical-py ```python import xgboost as xgb...

gsheni

LogTransformer can raise `TypeValidationError` if integer nullable y is passed in

1

```python import woodwork as ww X = pd.DataFrame({ "nullable bool col": [True, False, False, True, True] * 4, "nullable int col": [0, 1, 2, 0, 3] * 4, }) X.ww.init()...

tamargrey

Use `.apply` to change categories' dtype in `handle_float_categories_for_catboost`

From https://github.com/pandas-dev/pandas/issues/51074 using `apply(str)` can be used to set the float categories to be strings, and we can try to see if that lets us use the actual float categories....

tamargrey

Error while running make_pipeline_from_data_check_output while applying data checks suggestion

1

actions_pipeline = make_pipeline_from_data_check_output(problem_type, messages) data_df, y = actions_pipeline.fit(data_df, target) ################################################# Error Message: File /anaconda/envs/azureml_py310_sdkv2/lib/python3.10/site-packages/woodwork/logical_types.py:475, in IntegerNullable.transform(self, series, null_invalid_values) 473 if null_invalid_values: 474 series = _coerce_integer(series) --> 475 return super().transform(series) File...

sainiudit

bug

evalml
evalml copied to clipboard

Metadata

TimeSeriesImputer should not allow interpolate as strategy for boolean or categorical targets

SimpleImputer can raise `TypeConversionError` if `mean` or `median` strategy used with boolean data

Remove prophet-specific testing

Make `_drop_time_index()` public and call when using `transform_all_but_final()`

TimeSeriesImputer: Remove nullable type handling when pandas adds nullable type support for interpolate

ARIMARegressor should fill boolean nans with the mode then convert to double to fully support boolean values

Use enable_categorical flag in XGBClassifier to avoid encoding categorical features

LogTransformer can raise `TypeValidationError` if integer nullable y is passed in

Use `.apply` to change categories' dtype in `handle_float_categories_for_catboost`

Error while running make_pipeline_from_data_check_output while applying data checks suggestion

← Metadata

Owner

Metadata

evalml evalml copied to clipboard

Metadata

← Metadata

Owner

Metadata

evalml
evalml copied to clipboard