automl-toolkit
automl-toolkit copied to clipboard
Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Model selection and training, Hyper parameter optimization and se...
hey guys, i'm reading the source code and i would like to sincerely thank all those works you've done there, and public all of code too. But i noticed that...
When following [this tutorial](https://pages.databricks.com/rs/094-YMS-629/images/automl-simplify-loan-risk-analysis-xgb-optimize.html?_ga=2.223345816.2032433623.1572270822-1606208156.1568260249), I encounter the following error during feature selection thrown by DataSplitUtility: `java.lang.IllegalArgumentException: requirement failed: Number of partitions (0) must be positive.` The thing I do differently...
Since yesterday, I tried using FamilyRunner and it works past `DropColumnsTransformer` stage as long as I don't use "chronological" split method -- but fails in `DataSplitUtility.split` as reported [here](https://github.com/databrickslabs/automl-toolkit/issues/21) The...
Bumps [junit](https://github.com/junit-team/junit4) from 4.8.1 to 4.13.1. Release notes Sourced from junit's releases. JUnit 4.13.1 Please refer to the release notes for details. JUnit 4.13 Please refer to the release notes...
Hi, I guess this has to do with how I installed the package. Since I couln't install it via pip, I just git clone the thing to site-packages in my...
I am trying to run the python example on databricks. When I get to this line I get the error in the subject. `fi_importances = FI.run_feature_importances("XGBoost", "classifier", dataframe,20.0,"count",generic_overrides)` I have...
When the dbutils namespace was highjacked and compiled the `apiToken.get` doesn't work, with the maven compiled version of the code following the build instructions. It seems like it should work...
License
Hi, What is the plan with the license for this project? Will it become Apache 2.0 like DeltaLake? Also will there be a Spark 3.0/Scala 2.12 release? Thanks
Current pom.xml use MLflow 0.9.1 (pre-GA). Should be using latest MLflow 1.2.0.