Edmund

Results 3 issues of Edmund

### ❓ Question Hi, I noticed that the algorithm discount factor and reward discount factor are set to be the same in lines 365-367 in rl_zoo3/exp_manager.py `# Use the same...

question

### 🐛 Bug Hi, When I try to run TQC hyperparameter optimization with multiple jobs (n-jobs>1) with a GPU (this also happens with multiple CPU cores and n-jobs=1), it gives...

bug

### What happened + What you expected to happen After implementing the `reset_config()` method for PPO and running PB2 with `reuse_actors=True` with Pendulum-v1, it gives this error: ``` 2024-04-05 18:04:17,154...

bug
P3
rllib
rllib-newstack