Rohin Shah comments

Repositories
Issues
Comments

Results 2 comments of


                                            Rohin Shah

MaxEnt IRL Run-time optimization

I have an implementation of value iteration in Numpy that's about 50-100x faster than my Python implementation in `FastOptimalAgent` [here](https://github.com/HumanCompatibleAI/planner-inference/blob/master/fast_agents.py). (But my Python implementation is likely a lot slower than...

MaxEnt IRL Run-time optimization

Oh, yes, it's assuming a deterministic MDP. I forgot that your gridworlds are slippery. That said, you should only need to change the lines that add discounted_values to the qvalues...