baedan comments

Results 25 comments of


                                            baedan

estimate v.s. basis in policies

i feel like we're talking past each other, and there's gotta be some concepts that we are defining very differently, haha

estimate v.s. basis in policies

thanks for the responses. i'm tapped out for the day, will think about it more tomorrow

estimate v.s. basis in policies

i think i understand the disconnect now. a policy in this package is not just the classically defined, stationary map from a state to a probability distribution over the action...

estimate v.s. basis in policies

well, i can't wait for the next release. :D one thing though: what i mean by _policy evaluation_ or _prediction_ is not `(p::AbstractPolicy)(env)`, but evaluating the state/action values of a...

estimate v.s. basis in policies

thanks, i'll look into it. all in all, this thread did make me consider more deeply the various aspects of design, which are indeed challenging.

various eligibility trace-equipped TD methods

> In fact, we also need contributors to work on porting [tablar methods](https://github.com/JuliaReinforcementLearning/ReinforcementLearning.jl/tree/v0.10.1/src/ReinforcementLearningZoo/src/algorithms/tabular) in the latest workflow in the master branch. would be great if there’s a document i can...

baedan

estimate v.s. basis in policies

estimate v.s. basis in policies

estimate v.s. basis in policies

estimate v.s. basis in policies

estimate v.s. basis in policies

various eligibility trace-equipped TD methods

Categorical cannot take abstract array p

load flat parameters without mutation or `restructure`

load flat parameters without mutation or `restructure`

`AbstractGroup`s