Ethan Brown
Results
1
issues of
Ethan Brown
RL is a method of training. The actual NN used are often pretty standard feed-forward NN. One counter-example could be value iteration networks: https://arxiv.org/abs/1602.02867 But I would classify these separately.