5G-Federation
5G-Federation copied to clipboard
Dyna
Dyna needs more less episodes to converge
It seems that, in large problems, it is really beneficial to use it instead of direct Q-Learning