openbrain
openbrain copied to clipboard
Multiple Environments
trafficstars
- Confirm that Q values do not diverge
- Make sure environments are non-linear. (Linear updates still work)
Add program parameters to select environments, gamma