5G-Federation
5G-Federation copied to clipboard
Expected SARSA vs QL
Does the "Expected SARSA" do better than QL?
Except for the small additional computational cost, Expected Sarsa may completely dominate both of the other more-well-known TD control algorithms.