TabulaRL
TabulaRL copied to clipboard
Math behind your code: finite_tabular_agents.py(line 80 - line 90)
I can't figure out this math in your update observation function. Can you give me some tips
I think I figure out this math. Thanks