老董 issues

Results 37 issues of


                                            老董

Is this algorithm suitable for off-policy policy?

I just finished reading your paper, and I notice that it is an on policy method. And I wondering if anyone has tested it with an rl method that has...

Why "alpha_prime" and "min_q_weight" are not the same thing when using Lagrange?

Under my understanding, if automatically tuning cql weight with Lagrange, then in the code, "alpha_prime" and "min_q_weight" should be the same thing, right?

请问下演示gif最后时刻右侧的控件是什么？

就是有点像ios的touch assistant的widget.

For SAC-discrete version, is it possible to update model with input of state and action just like Sac-continuous version?

Currently, I am trying to merge models for SAC discrete and continous version into just 1 model. According to SAC discrete critic_model, it only need input state and output action...

get Error "FileNotFoundError: [Errno 2] No such file or directory"

I got error like this, does it mean I have to compile it manually? I just ran the example: ``` import random from MAMEToolkit.sf_environment import Environment # roms_path = "roms/sf2ceua"...

Is this optimization also benefit small network?

After reading this paper, I found that the biggest optimization is to predict core neuros that effects output most. So... if the network is small originally, (say.. only 1000 parameters),...

Would that be possible to offer the project in QT solution but not CMakeList

I currently work with win10. And I know CMake would be used to wrap out C++ in to specific solution related to specific OS(like win10 in my example). But I...

Question on formula of the continuous action

First, thank your for the code related to paper `Discrete and Continuous Action Representation for Practical RL in Video Games`. Second, according to your code, all of action spaces of...

Does this binding makes it feasible to run gym_env in true multi-threads?

I found this project when trying to find a way to truely run openai gym parallelly with algorithm written in rust. Is it feasible to run gym_env in true multi-threads...

I hope one day this could be used to create model and auto-topology

At present, when inputting "create a cat", the GPT3.5 would output something like this: ![image](https://user-images.githubusercontent.com/17372655/228202708-fbc37fac-8206-40e1-bc82-7cd46a064aa3.png) I know this weird output is not due to this extension. I just want to...