tf2rl
tf2rl copied to clipboard
Example of GAIFO for atari?
I tried to make my own using atari categorical actor critic for PPO, but I always get this error: ` /home/rj/.local/lib/python3.8/site-packages/tf2rl/algos/vpg.py:187 _get_action_body * action, logp = self.actor_critic(state, test)
ValueError: too many values to unpack (expected 2)
`