Youngwoon Lee
Youngwoon Lee
In `rollout.py`, `run` and `run_episode` share most of the code. We can merge them and replace `run_episode` with `run(every_episodes=1, is_train=False)`.
Match the output dimension of critic networks. For example, [128, 1] in PPO and [128] in Dreamer). We can reduce the last dimension.
# Contributing to CSrankings Thanks for contributing to CSrankings! Please read and indicate you agree with **all** these guidelines to getting your pull request accepted. Note that pull requests may...