evalml icon indicating copy to clipboard operation
evalml copied to clipboard

EvalML is an AutoML library written in python.

Results 249 evalml issues
Sort by recently updated
recently updated
newest added

Right now, `component_graph.get_component` expects a string which is the unique name used to find a component in the graph (ex: "My Label Encoder", and not "Label Encoder"). This makes it...

new feature
good first issue

We currently use logistic reg for classification and linear reg for regression. I bet lasso would perform better!

new feature
performance

This will involve a larger discussion on how we want to integrate this and what form the warm-start feature should take. Currently I see three implementations that we can consider...

new feature
needs design
performance
spike

This issue was brought up [here](https://alteryx.atlassian.net/wiki/spaces/PS/pages/643268717/Handle+Unknown+types+from+Woodwork+in+EvalML?focusedCommentId=653230126#comment-653230126) as we integrate the new WW update into EvalML. Primarily, we want to raise a datacheck warning/error when the dataset a user passed in...

enhancement
spike

Our Makefile lists the following for the `git-test-prophet` command: ``` pytest evalml/tests/component_tests/test_prophet_regressor.py evalml/tests/component_tests/test_components.py evalml/tests/component_tests/test_utils.py evalml/tests/pipeline_tests/ evalml/tests/utils_tests/ ``` So we run the prophet unit tests in `component_tests/test_prophet_regressor` but we also run...

refactor
testing
tech debt

Just a placeholder issue for generating issues for the TODO's in our code. Can turn this into an epic as necessary. Successful completion of this issue/epic is building or associating...

task
spike
tech debt

Good at handling imbalanced data: https://www3.nd.edu/~nchawla/papers/DMKD11.pdf I couldn't find a widely-used python impl. I did find [this](https://github.com/EvgeniDubov/hellinger-distance-criterion#example), looks like there's some cython going on. The same author wrote [this nice...

new feature
needs design
performance
new component

Rather than relying on the CV scores to rank the pipelines on the leaderboard, perhaps we should have a model selection split where we hold out some data and rank...

new feature
needs design
spike

Follow up from PR #2905 where we temporarily disabled running `explain_predictions` and `explain_predictions_best_worst` with CatBoost models running the LIME algorithm (details in comment [here](https://github.com/alteryx/evalml/pull/2905#discussion_r733115415)). We should figure out how to...

enhancement