auto-sklearn Multi-output regression example is broken (potential macOS problem)

Multi-output regression example is broken (potential macOS problem)

Open timothygebhard opened this issue 1 year ago • 7 comments

Describe the bug

When I try to run the example on multi-output regression from the docs, the .fit() routine crashes with a ValueError.

To Reproduce

Steps to reproduce the behavior:

Download or copy code from the docs.
Run the code.
Watch the .fit() routine crash.

Expected behavior

The code should run through and produce results like in the docs.

Actual behavior, stacktrace or logfile

The .fit() routine crashes with the following error message:

[ERROR] [2022-09-29 16:41:12,619:Client-AutoML(1):synthetic] (' Dummy prediction failed with run state StatusType.CRASHED and additional output: {\'error\': \'Result queue is empty\', \'exit_status\': "<class \'pynisher.limit_function_call.AnythingException\'>", \'subprocess_stdout\': \'\', \'subprocess_stderr\': \'Process pynisher function call:\\nTraceback (most recent call last):\\n  File "/usr/local/Cellar/[email protected]/3.9.14/Frameworks/Python.framework/Versions/3.9/lib/python3.9/multiprocessing/process.py", line 315, in _bootstrap\\n    self.run()\\n  File "/usr/local/Cellar/[email protected]/3.9.14/Frameworks/Python.framework/Versions/3.9/lib/python3.9/multiprocessing/process.py", line 108, in run\\n    self._target(*self._args, **self._kwargs)\\n  File "/Users/timothy/.virtualenvs/moml/lib/python3.9/site-packages/pynisher/limit_function_call.py", line 108, in subprocess_func\\n    resource.setrlimit(resource.RLIMIT_AS, (mem_in_b, mem_in_b))\\nValueError: current limit exceeds maximum limit\\n\', \'exitcode\': 1, \'configuration_origin\': \'DUMMY\'}.',)
[ERROR] [2022-09-29 16:41:12,619:Client-AutoML(1):synthetic] (' Dummy prediction failed with run state StatusType.CRASHED and additional output: {\'error\': \'Result queue is empty\', \'exit_status\': "<class \'pynisher.limit_function_call.AnythingException\'>", \'subprocess_stdout\': \'\', \'subprocess_stderr\': \'Process pynisher function call:\\nTraceback (most recent call last):\\n  File "/usr/local/Cellar/[email protected]/3.9.14/Frameworks/Python.framework/Versions/3.9/lib/python3.9/multiprocessing/process.py", line 315, in _bootstrap\\n    self.run()\\n  File "/usr/local/Cellar/[email protected]/3.9.14/Frameworks/Python.framework/Versions/3.9/lib/python3.9/multiprocessing/process.py", line 108, in run\\n    self._target(*self._args, **self._kwargs)\\n  File "/Users/timothy/.virtualenvs/moml/lib/python3.9/site-packages/pynisher/limit_function_call.py", line 108, in subprocess_func\\n    resource.setrlimit(resource.RLIMIT_AS, (mem_in_b, mem_in_b))\\nValueError: current limit exceeds maximum limit\\n\', \'exitcode\': 1, \'configuration_origin\': \'DUMMY\'}.',)
Traceback (most recent call last):
  File "/Users/timothy/.virtualenvs/moml/lib/python3.9/site-packages/autosklearn/automl.py", line 765, in fit
    self._do_dummy_prediction()
  File "/Users/timothy/.virtualenvs/moml/lib/python3.9/site-packages/autosklearn/automl.py", line 489, in _do_dummy_prediction
    raise ValueError(msg)
ValueError: (' Dummy prediction failed with run state StatusType.CRASHED and additional output: {\'error\': \'Result queue is empty\', \'exit_status\': "<class \'pynisher.limit_function_call.AnythingException\'>", \'subprocess_stdout\': \'\', \'subprocess_stderr\': \'Process pynisher function call:\\nTraceback (most recent call last):\\n  File "/usr/local/Cellar/[email protected]/3.9.14/Frameworks/Python.framework/Versions/3.9/lib/python3.9/multiprocessing/process.py", line 315, in _bootstrap\\n    self.run()\\n  File "/usr/local/Cellar/[email protected]/3.9.14/Frameworks/Python.framework/Versions/3.9/lib/python3.9/multiprocessing/process.py", line 108, in run\\n    self._target(*self._args, **self._kwargs)\\n  File "/Users/timothy/.virtualenvs/moml/lib/python3.9/site-packages/pynisher/limit_function_call.py", line 108, in subprocess_func\\n    resource.setrlimit(resource.RLIMIT_AS, (mem_in_b, mem_in_b))\\nValueError: current limit exceeds maximum limit\\n\', \'exitcode\': 1, \'configuration_origin\': \'DUMMY\'}.',)

Environment and installation:

Please give details about your installation:

OS: macOS Monterey (12.6)
Environment: virtualenv
Python version: 3.9.14 (main, Sep 6 2022, 23:29:09) [Clang 13.1.6 (clang-1316.0.21.2.5)]
Auto-sklearn version: 0.15.0 installed via pip

Additional comments:

When I run the Binder version of the notebook, the problem does not occur. Maybe this means it is a macOS-specific issue? After all, multiprocessing is one of those modules that don't always behave the same on Linux and macOS.

Sep 29 '22 14:09 timothygebhard

Hi @timothygebhard,

Yup this is a known issue and pynisher had been updated to fix it, but we need to update quite a few things at once to make this work. This will be done at the start of the october once I can return to working on auto-sklearn.

Btw, turns out we just can't limit memory with MacOS: https://github.com/automl/pynisher#details. The future iterations of auto-sklearn will simply just raise a warning when attempting to limit memory on Mac and that's all we can do.

Best, Eddie

Sep 29 '22 15:09 eddiebergman

Hi @eddiebergman is there any update on this issue? Is there any workaround? My setup:

Apple M1 Pro
16 GB
OS version 13.1

Jan 04 '23 01:01 oantoshchenko

Hi @eddiebergman! I got the same issues here, is anyway we can solve this like change memory limit or number of cores?

here is my environment:

Apple M1 Pro
16 GB
MacOS Monterey 12.1

Mar 13 '23 22:03 daisyyywang

Sorry I missed this! No there's been no update, it's a Mac operating system problem that we can't limit memory, your best bet with auto-sklearn the only solution I know of is to set no memory limit, but this can potentially leads to more problems.

Mar 20 '23 15:03 eddiebergman

Sorry I missed this! No there's been no update, it's a Mac operating system problem that we can't limit memory, your best bet with auto-sklearn the only solution I know of is to set no memory limit, but this can potentially leads to more problems.

@eddiebergman thank you for the reply. The issue is not with setting a memory limit, the issue is that the error makes usage of the library on Mac impossible. Even though you would not use Mac in a production setting, but it reduces the number of people who want to experiment with the library, like me. Ultimately I went with another library, as running auto-sklearn in a docker container while debugging via ssh, is not optimal.

Mar 20 '23 17:03 oantoshchenko

How does one set no memory limit?

Mar 25 '23 13:03 jtlz2

Jul 22 '23 23:07 Sunishchal

auto-sklearn auto-sklearn copied to clipboard

Multi-output regression example is broken (potential macOS problem)

Describe the bug

To Reproduce

Expected behavior

Actual behavior, stacktrace or logfile

Environment and installation:

Additional comments:

auto-sklearn
auto-sklearn copied to clipboard