async-rl
async-rl copied to clipboard
Example of Actor critic for large number of actions
Is the a3c implementation done?
Do you have an example implementation of actor critic for large action spaces. I see that there is a example for a3c, but the action space for this problem is small.