Justin Yuan comments

Results 5 comments of


                                            Justin Yuan

Two problem about update function

i think it's to take the expectation of the Bellman error target since you need to marginalize over next actions when evaluating the next q value

Should controllers be allowed to use `U_GOAL` or only `U_EQ`?

- For RL cost I think it should use the true parameters since that's the **only** source of information for learning, and if that is not the true ones, there's...

Should controllers be allowed to use `U_GOAL` or only `U_EQ`?

But for the control methods, this can be tricky since the cost function is part of both the control algorithm and the environment. The ideal case is we have a...

Should controllers be allowed to use `U_GOAL` or only `U_EQ`?

@adamhall Do we currently have anywhere that needs to be fixed regarding this issue?

Should controllers be allowed to use `U_GOAL` or only `U_EQ`?

I am leaning towards using `symbolic.U_EQ` for linearization and `env.U_EQ` for cost function or reward, the current/updated symbolic model should already be able to expose `U_EQ`, but I'm not sure...