Julius Parulek

Results 82 comments of Julius Parulek

Solved by https://github.com/equinor/ert/pull/7562

We should write a test which asserts that the job was resubmitted when there was a signal when the job got a signal. The desired behaviour is to always resubmit...

@oyvindeide it might have been fixed by https://github.com/equinor/ert/pull/8015 Let's see it the error happens again.

Haven't seen this this since the potential fixture so closing this issue.

Would it be realistic to make the monitor.py fully async? Not at the least to get rid of the duplexer. ~~Update: I have a Monitor_async.py version in my PR: https://github.com/equinor/ert/pull/6994~~...

Don't understand this tbh. What's the issue - is it to avoid `force-push`?

> Unsure about the other one, but given the very limited compute nodes available on rhel8 the tests might time out before the job is actually submitted to a compute...

We need to get the logs (both ert-logs and job-runner-logs) for this one.

How to reproduce: increase MAX_SUBMIT and random SEGFAULT; eg. signal process to kill itself.

Reproduced locally with `pytest -vs tests/unit_tests/simulator/test_batch_sim.py::test_stop_sim --count=1000 -k using_scheduler` ```sh tests/unit_tests/simulator/test_batch_sim.py::test_stop_sim[using_scheduler-153-1000] +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Timeout ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Stack of Thread-153 () (140569485747968) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ File "/usr/lib64/python3.11/threading.py", line 995, in _bootstrap self._bootstrap_inner() File "/usr/lib64/python3.11/threading.py",...