matluster

Results 24 comments of matluster

There is coming new implementation of PolicyBasedRL in NNI v3.0. The large part of coding work is done and I just incorporate the new API of Gymnasium: https://github.com/microsoft/nni/commit/2efe0c6502d354d99181e05256823063458854ed It will...

We have people working on an example. There is currently no blocking issue.

The launching should be similar to verl. Please refer to the verl example. We will need to find an environment with multi nodes and multi gpus to test this.

Multiple actor instances can be set via `n_workers = X` in trainer.