simonsays1980 issues

Results 30 issues of


                                            simonsays1980

[RLlib] - Prioritized multi agent episode replay buffer.

## Why are these changes needed? This PR adds prioritized sampling for multi-agent setups. It implements `"independent"` sampling by holding sum- and min-segments for each module and updates them accordingly...

rllib

rllib-newstack

[RLlib] - Env runners cleanup metrics window sizes

## Why are these changes needed? The user defines the window size for metrics in `metrics_num_episodes_for_smoothing`. This needs to be applied to all episode metrics to keep consistency. This PR...

bug

enhancement

rllib

rllib-newstack

[RLlib] Moving sampling coordination for `batch_mode=complete_episodes` to `synchronous_parallel_sample`.

## Why are these changes needed When sampling complete episodes each `EnvRunner` sampled `train_batch_size` before returning. This made sampling inefficient and led to long waiting times in case slow environments...

bug

rllib

rllib-evaluation

rllib-samplingbackend

[RLlib] `grad_clip` parameter appears ineffective for TensorFlow implementations with multiple optimizers.

### What happened + What you expected to happen # What happened I ran the TensorFlow PPO algorithm on a problem that gave me some instable gradients. I wanted to...

rllib

[Data] - Reading zipped JSONL files results in error.

### What happened + What you expected to happen # What happened I ran the script below to read in data in zipped JSONL format and ran into this error:...

bug

triage

rllib

rllib-models

rllib-docs-or-examples

simonsays1980

[RLlib] - Prioritized multi agent episode replay buffer.

[RLlib] - Env runners cleanup metrics window sizes

[RLlib] Moving sampling coordination for `batch_mode=complete_episodes` to `synchronous_parallel_sample`.

[RLlib] `grad_clip` parameter appears ineffective for TensorFlow implementations with multiple optimizers.

[Data] - Reading zipped JSONL files results in error.

[RLlib; Offline RL] Add cloud filesystems to offline data input arguments.

[RLlib; Offline RL] - Enable reading old-stack `SampleBatch` data in new stack Offline RL.

[RLlib; Offline RL] Store episodes in state form.

[RLlib; Offline RL] - Validate episodes before adding them to the buffer.

[RLlib] `AutoregressiveActionsRLM` overhaul to fix flaky test and simplify.