Chi Wang

Results 343 comments of Chi Wang

@jmrichardson Please let us know whether you would like to make the suggested changes in this PR. If so, we'll wait for that. If not, could you please create issues...

Sorry, one more thing to fix before we can merge this PR: https://github.com/microsoft/FLAML/runs/7680527800?check_suite_focus=true#step:8:12 An easy way to avoid the lint issues is to use [pre-commit](https://microsoft.github.io/FLAML/docs/Contribute#pre-commit).

> Dude, contributing to Microsoft projects is really painful! :)) The CLA signing is a one-time procedure. You won't need it for every PR. Some checks failed: https://github.com/microsoft/FLAML/runs/7702402957?check_suite_focus=true#step:5:51

> @sonichi Looks like we need to upgrade to a higher version of Sklearn (1.1.2 based on [PyPI](https://pypi.org/project/scikit-learn/)) Do you know any other solution, except removing the items that give...

Great suggestion. These are the relevant places to be changed: https://github.com/microsoft/FLAML/blob/a64956a7c8f98db6a7de7e8cc5d0ce53e9c86a03/flaml/automl.py#L1052 https://github.com/microsoft/FLAML/blob/a64956a7c8f98db6a7de7e8cc5d0ce53e9c86a03/flaml/automl.py#L1164 allow one-column dataframe and convert to series. https://github.com/microsoft/FLAML/blob/a64956a7c8f98db6a7de7e8cc5d0ce53e9c86a03/flaml/automl.py#L1560 https://github.com/microsoft/FLAML/blob/a64956a7c8f98db6a7de7e8cc5d0ce53e9c86a03/flaml/automl.py#L2086 https://github.com/microsoft/FLAML/blob/a64956a7c8f98db6a7de7e8cc5d0ce53e9c86a03/flaml/automl.py#L2178 update docstr. @qingyun-wu @ekzhu Please comment if I...

> Thanks! Would I not want to convert the one-column dataframe to a column before using it, or will that happen automatically if it passes the assertions? Please convert it....

@tugra-alp Do you mean seeing the X_train and X_val for each fold during the hyperparameter search? Or the full training data used for the final model training?

Right now there is no such info exposed. You need to insert some logging statement after: https://github.com/microsoft/FLAML/blob/2f5d6169d3b5cc025eb2516cbd003fced924a88e/flaml/ml.py#L452-L457

@busekoseoglu I can't reproduce this problem with my synthetic data for testing. Could you please share an example dataset to reproduce this problem? BTW, you don't have to use one...

This works for me: ``` from flaml import AutoML import pandas as pd df = pd.read_csv("https://github.com/microsoft/FLAML/files/8496779/sampledf.csv") X = df.drop(columns="has_paid") y = df["has_paid"] from sklearn.pipeline import Pipeline from sklearn.compose import ColumnTransformer...