Ronald Seoh
Results
2
comments of
Ronald Seoh
There is no previous value function when we start policy evaluation.
I was referring to V when I said "value function", because this is a special case of state value function where the function can be defined using a table consisting...