rl_algorithms
rl_algorithms copied to clipboard
Implement A2C for discrete environment
A2C algorithm is implemented for continuous environment like Lunarlander-continuous now. We should implement A2C for discrete environment because its performance can be better in discrete env.