ReinforcementLearning.jl
ReinforcementLearning.jl copied to clipboard
Support multiple discrete action space
A2C and PPO can be improved further to support mutiple discrete action space