wbwatkinson
wbwatkinson
How do I make it work on Windows 7 64-bit? I have Processing 3.0b5, and I receive the following error: A library used by this sketch is not installed properly....
The Inverted Gradients technique is applied on the back-propagation gradients output of the critic before they are applied to the actor. The relevant code is in dqn.cpp, lines 922-965 (https://github.com/mhauskn/dqn-hfo/blob/c7b0a73de07078e248015d44573d8dcadd6fb8d1/src/dqn.cpp#L922)....
This code is very raw, and it there are some problems with the learning, which is slow and unstable. Specifically, the 4 discrete action values (used to probabilistically select which...
I had an error in the code... corrected now (https://github.com/wbwatkinson/ddpg-hfo-python). Unless there are other questions about the inverting gradients algorithm, I recommend closing this.
I think it would be best to discuss the specifics of the Python code in the other repository. That said, I made two changes that stabilized learning. 1) Correction to...