Kyle Daruwalla comments

Results 404 comments of


                                            Kyle Daruwalla

Refresh Model Zoo

If most of the examples use `train!`, and we converge on removing it or downplaying it, then we taught a bunch of users to use a non-preferred API. The for-loop...

[Discussion]: Update and periodically test posts and model-zoo tutorials

I started revamping the site, but I didn't finish before getting pulled away. I'll push what I have as a draft today. I don't think the model zoo or any...

Maintainers

I would suggest getting in touch with @findmyway. I saw you posted in the FastAI port channel on Slack, and this issue was part of ML community effort along the...

Would it make sense to use https://github.com/JuliaPOMDP/RLInterface.jl before registering this? That would make the environments interface with the rest of JuliaPOMDP which has a lot of policies and methods already...

Register?

Yup I was [thinking the same thing](https://github.com/JuliaReinforcementLearning/CommonRLInterface.jl/issues/18#issuecomment-647562997). Would be happy to give it a shot. Maybe this weekend?

Register?

Again was thinking along the same lines when I was commenting on that issue. RLEnv implements an interface that supersedes CommonRLInterface. I was thinking of doing the later here as...

Standard action space for DiscreteSpace

I think logically, a discrete to continuous mapping would be `{1, ..., n} --> [1.0, n]`. Beyond that, I think it is unique to each environment. For example, in CartPole,...

Standard action space for DiscreteSpace

Been thinking about this recently. Should we establish an experimental `zygote` branch that uses custom adjoints to implement differentiable DiscreteSpaces?

Handle gradients and object trees together

I think the old interface matches https://github.com/FluxML/Functors.jl/pull/1 better, but I'm fine putting everything into a tuple. I will still suggest that the interface should be: ```julia update(o, st, x, (dx1,...

Handle gradients and object trees together

What do we lose by sticking with the interface already on master? Why are we wanting to add additional steps to the design?