Jun Tian issues

Results 70 issues of


                                            Jun Tian

Support CircularVectorSARTTrajectory RLZoo

Most deep rl algorithms in RLZoo assume the state to be an array. However, states of a graph or any other data structure should also be supported out of the...

enhancement

RLZoo

Reinforcement Learning and Combinatorial Optimization

Another interesting direction. - [Reinforcement Learning for Combinatorial Optimization: A Survey](https://arxiv.org/abs/2003.03600)

enhancement

RLEnvs

Derivative-Free Reinforcement Learning

This seems like an interesting direction and it may require a specialized workflow. Ref: - [Derivative-Free Reinforcement Learning: A Review](https://arxiv.org/abs/2102.05710) - [BlackBoxOptim.jl](https://github.com/robertfeldt/BlackBoxOptim.jl) - [CMAEvolutionStrategy.jl](https://github.com/jbrea/CMAEvolutionStrategy.jl) - [BayesianOptimization.jl](https://github.com/jbrea/BayesianOptimization.jl) - [Evolutionary.jl](https://github.com/wildart/Evolutionary.jl) - [Evolving...

Support Tables.jl and PrettyTables.jl for Trajectories

RLCore

Change clip_by_global_norm! into a Optimizer

https://github.com/JuliaReinforcementLearning/ReinforcementLearningCore.jl/blob/63f306d99a6db736a1755a5d1e26f2aa8e8822dc/src/extensions/Zygote.jl#L10 Or maybe make a PR in Flux instead?

enhancement

good first issue

Flux as service

In current design of distributed rl, each worker creates an independent model and make predictions separately. A better solution might be that workers on the same node share some common...

design

DistRL

Jun Tian

Support CircularVectorSARTTrajectory RLZoo

Reinforcement Learning and Combinatorial Optimization

Derivative-Free Reinforcement Learning

Support Tables.jl and PrettyTables.jl for Trajectories

Change clip_by_global_norm! into a Optimizer

Flux as service

Experimental support of Torch.jl

Implement Fully Parameterized Quantile Function for Distributional Reinforcement Learning.

Add MCTS related algorithms

Regret Policy Gradients