automl-toolkit icon indicating copy to clipboard operation
automl-toolkit copied to clipboard

Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Model selection and training, Hyper parameter optimization and se...

Results 10 automl-toolkit issues
Sort by recently updated
recently updated
newest added

hey guys, i'm reading the source code and i would like to sincerely thank all those works you've done there, and public all of code too. But i noticed that...

When following [this tutorial](https://pages.databricks.com/rs/094-YMS-629/images/automl-simplify-loan-risk-analysis-xgb-optimize.html?_ga=2.223345816.2032433623.1572270822-1606208156.1568260249), I encounter the following error during feature selection thrown by DataSplitUtility: `java.lang.IllegalArgumentException: requirement failed: Number of partitions (0) must be positive.` The thing I do differently...

Since yesterday, I tried using FamilyRunner and it works past `DropColumnsTransformer` stage as long as I don't use "chronological" split method -- but fails in `DataSplitUtility.split` as reported [here](https://github.com/databrickslabs/automl-toolkit/issues/21) The...

Bumps [junit](https://github.com/junit-team/junit4) from 4.8.1 to 4.13.1. Release notes Sourced from junit's releases. JUnit 4.13.1 Please refer to the release notes for details. JUnit 4.13 Please refer to the release notes...

dependencies

Hi, I guess this has to do with how I installed the package. Since I couln't install it via pip, I just git clone the thing to site-packages in my...

I am trying to run the python example on databricks. When I get to this line I get the error in the subject. `fi_importances = FI.run_feature_importances("XGBoost", "classifier", dataframe,20.0,"count",generic_overrides)` I have...

When the dbutils namespace was highjacked and compiled the `apiToken.get` doesn't work, with the maven compiled version of the code following the build instructions. It seems like it should work...

bug

Hi, What is the plan with the license for this project? Will it become Apache 2.0 like DeltaLake? Also will there be a Spark 3.0/Scala 2.12 release? Thanks

Current pom.xml use MLflow 0.9.1 (pre-GA). Should be using latest MLflow 1.2.0.