Ada Böhm comments

Results 74 comments of


                                            Ada Böhm

Support multi-node tasks

We want to support "everything in hq", but so far, you are right: upto 1-node tasks to HQ and multi-node tasks directly into SLURM/PBS

Auto allocation table size

Second thoughs: 7. What about same naming scheme as "hq worker"? hq auto-alloc info => hq auto-alloc list hq auto-alloc allocations X => hq auto-alloc info X 8. What about...

Pass the expected remaining time to tasks

Just noting obvious: it should be technically min(remaining_worker_time, task_time_limit).

Pass the expected remaining time to tasks

Btw: Maybe we can named it "HQ_REMAINING_SECONDS" to make it clear what are the units.

Better task sandboxing

Yes (but as a requirement comes from HQ, I put it here)

Better task sandboxing

Launcher of tasks was moved into HQ, so it is now issue for HQ

Kill all executing tasks on worker if worker crashes

It is also connected to #66, that a each task should be spawned into a process group and when it is canceled we should clean all processes

Running on pre-emptible batch queue crashes allocation queue in the HyperQueue

I am not able to find any information if we can distinguish a lost of worker because of preemption from a crash. But I would guess that it has to...

Running on pre-emptible batch queue crashes allocation queue in the HyperQueue

For completeness: you can start worker as follows: ``hq worker start --on-server-lost=finish-running`` and it will finish currently running jobs when server is lost. But reporting of these jobs is lost.

Leading newlines are stripped from code segments

It is done slightly on purpose. Can you share your use case? It cannot be switched off in the current version. But is like 3 lines of code to add...