evalml issues

Update pipeline `save` / `load` to use pickle instead of cloudpickle

1

enhancement

Allow duplicate pipeline names in AutoMLSearch

Follow-up from #1858 We should allow pipelines with repeating names to be passed in to `AutoMLSearch`. Some things to consider: 1. The tuners in `AutoMLAlgorithm` are keyed by pipeline name....

freddyaboulton

enhancement

needs design

Add list of contributors to each release

1

As we expand into the OS community and outside users contribute to EvalML, it would be nice to have a list of contributors for each release as Featuretools, Woodwork, and...

angela97lin

documentation

Add model understanding integration tests

3

In #2815, we added integration tests for DataCheckActions + DataChecks. I think another potential use case is doing AutoMLSearch + model_understanding. We do something similar with LG so I'm open...

freddyaboulton

enhancement

testing

Add vowpal wabbit to AutoMLSearch

1

https://github.com/alteryx/evalml/pull/2846 added vowpal wabbit estimators to EvalML, but they are currently not used in AutoMLSearch. This would require performance testing and determining a good set of hyperparameter ranges for the...

angela97lin

enhancement

new feature

performance

Profile ARIMA to identify performance bottlenecks

In #2365, we noticed that some arima tests took about a minute to run despite only running on 500 rows. @ParthivNaresh thinks it may be because the starting set of...

freddyaboulton

performance

spike

Imputer modifies user data when user passes in a DataTable

3

```python import woodwork as ww import pandas as pd import numpy as np from evalml.pipelines.components import Imputer df = ww.DataTable(pd.DataFrame({ "all nan": [np.nan, np.nan, np.nan, np.nan, np.nan], "all nan cat":...

freddyaboulton

bug

Smarter values of top_n for One Hot Encoder in AutoML

1

Currently, `AutoMLSearch` will only fit `OneHotEncoder`s with `top_n` [set to 10](https://github.com/alteryx/evalml/blob/main/evalml/pipelines/components/transformers/encoders/onehot_encoder.py#L25). This can be problematic because a user can have data with more than 10 categories, e.g. 50 US states,...

freddyaboulton

enhancement

needs design

Update `make_pipeline_from_data_check_output` and `make_pipeline_from_actions` to introduce new method that handles automl config

After we introduce action codes that indicate changes to automl config, we should update our methods to handle actions (`make_pipeline_from_data_check_output` and `make_pipeline_from_actions`) to handle automl configuration recommendations.

angela97lin

enhancement

Add label leakage check after training models in AutoML

1

Per @kmax12's comment in #917, we could do a label leakage check after training a model, checking if it scored very highly but only has a single feature with all...

angela97lin

new feature

evalml
evalml copied to clipboard

Metadata

Update pipeline `save` / `load` to use pickle instead of cloudpickle

Allow duplicate pipeline names in AutoMLSearch

Add list of contributors to each release

Add model understanding integration tests

Add vowpal wabbit to AutoMLSearch

Profile ARIMA to identify performance bottlenecks

Imputer modifies user data when user passes in a DataTable

Smarter values of top_n for One Hot Encoder in AutoML

Update `make_pipeline_from_data_check_output` and `make_pipeline_from_actions` to introduce new method that handles automl config

Add label leakage check after training models in AutoML

← Metadata

Owner

Metadata

evalml evalml copied to clipboard

Metadata

← Metadata

Owner

Metadata

evalml
evalml copied to clipboard