Artur Niederfahrenhorst issues

Results 23 issues of


                                            Artur Niederfahrenhorst

[RLlib] Clean up deprecated concat_samples calls

Signed-off-by: Artur Niederfahrenhorst ## Why are these changes needed? We are still using our own deprecated API for caoncating samples. ## Checks - [x] I've signed off every commit(by using...

[RLlib] Chaining sub-models in RLModules with dynamic spec keys inside forward methods ("Solution 1")

## Why are these changes needed? As discussed via Slack, we need to match the future ModelCatalog, the Models and RLModule in the sense of where to define specs and...

do-not-merge

[RLlib] Deflake replay buffer demo - Up test size to leave a little more time for ReplayBufferDemo

Signed-off-by: Artur Niederfahrenhorst ## Why are these changes needed? The replay buffer demo likes to take approx 10 iterations to reach the nice reward of 50 but often gets cancelled...

[RLlib] Don't modify SampleBatch in place when counting

Signed-off-by: Artur Niederfahrenhorst ## Why are these changes needed? We currently modify the original sample batch in place when counting instead of the copy. (We make everything except SampleBatch.SEQ_LENS an...

[RLlib] Replace ordinary pygame imports by try_imports

Signed-off-by: Artur Niederfahrenhorst ## Why are these changes needed? Replaces imports of pygame with try_imports, akin to the ones we use for frameworks. This helps us output a more informative...

tests-ok

[RLlib] Terminated/Truncated in Quickstart Script

Signed-off-by: Artur Niederfahrenhorst ## Why are these changes needed? Our quickstart example in the docs has been updated to fit gymnasium, but a docstring is missing the truncated flag and...

[RLlib] Enhance input readers to handle complex combinations of trained policies vs offline data

Signed-off-by: Artur Niederfahrenhorst ## Why are these changes needed? Our input readers today simply throw an error if we want to ingest single agent data in a multi-policy training workflow...

stale

[Tune] Some examples pages are not structured well

### Description Some of our Tune examples look a little weird ([example](https://docs.ray.io/en/master/tune/examples/lightgbm_example.html)). There is an index at the top with only one entry “Example”. Also, we dump some output that...

good first issue

triage

docs

[RLlib] Compile update logic on learner and use cudagraphs

## Why are these changes needed? In the first attempt to leverage torch compile, we didn't introduce a compiled update method on the side of the learner (1) and also...

[RLlib] RNNs and RLModules

## Why are these changes needed? This is a fork of https://github.com/ray-project/ray/pull/30997 to experiment with how we incorporate RNNs into RLModules.

do-not-merge