dask-ml icon indicating copy to clipboard operation
dask-ml copied to clipboard

support for non dask arrays for HyperbandSearchCV

Open gioxc88 opened this issue 5 years ago • 6 comments
trafficstars

The main changes in this PR are:

  1. change to the async def _fit function in dask_ml.model_selection._incremental.py to allow Hyperband to work with non dask arrays

  2. fixed default test_size which didn't work with pandas dataframe

gioxc88 avatar Oct 28 '20 18:10 gioxc88

@stsievert I implemented the changes we discussed. sorry it took so long but I have been busy with other projects.

Cheers Gio

gioxc88 avatar Nov 20 '20 14:11 gioxc88

@stsievert do you have another chance to look at this?

TomAugspurger avatar Nov 22 '20 16:11 TomAugspurger

Hi @stsievert I agree with you in principle the check should be done on X_train. And sorry for the comments, it was just for me to remember where I made changes. Of course I can remove them.

EDIT: Done

gioxc88 avatar Nov 22 '20 19:11 gioxc88

does anyone have any idea of what happened with the tests? it's like it's stuck

many thanks

gioxc88 avatar Dec 02 '20 23:12 gioxc88

I'm not sure, but it does look there's a linting issue (both black and isort fail):

Checking black...
black, version 19.10b0
would reformat /home/vsts/work/1/s/dask_ml/model_selection/_incremental.py
would reformat /home/vsts/work/1/s/tests/model_selection/test_hyperband.py
Oh no! 💥 💔 💥
2 files would be reformatted, 98 files would be left unchanged.
Checking black... DONE
Checking isort...
4.3.21
ERROR: /home/vsts/work/1/s/tests/model_selection/test_hyperband.py Imports are incorrectly sorted.
Checking isort... DONE

Try making those lint changes and pushing; that will (likely) resolve the issue.

stsievert avatar Dec 02 '20 23:12 stsievert

I've fixed the linting changes locally (and substantially reduced the computational size of the test), but the test_pandas still hangs. I've played around a bit, but haven't been able to identify the root cause yet.

hristog avatar Mar 21 '21 11:03 hristog