老董

Results 34 issues of 老董

I just finished reading your paper, and I notice that it is an on policy method. And I wondering if anyone has tested it with an rl method that has...

Under my understanding, if automatically tuning cql weight with Lagrange, then in the code, "alpha_prime" and "min_q_weight" should be the same thing, right?

就是有点像ios的touch assistant的widget.

Currently, I am trying to merge models for SAC discrete and continous version into just 1 model. According to SAC discrete critic_model, it only need input state and output action...

I got error like this, does it mean I have to compile it manually? I just ran the example: ``` import random from MAMEToolkit.sf_environment import Environment # roms_path = "roms/sf2ceua"...

After reading this paper, I found that the biggest optimization is to predict core neuros that effects output most. So... if the network is small originally, (say.. only 1000 parameters),...

I currently work with win10. And I know CMake would be used to wrap out C++ in to specific solution related to specific OS(like win10 in my example). But I...

First, thank your for the code related to paper `Discrete and Continuous Action Representation for Practical RL in Video Games`. Second, according to your code, all of action spaces of...

I found this project when trying to find a way to truely run openai gym parallelly with algorithm written in rust. Is it feasible to run gym_env in true multi-threads...

At present, when inputting "create a cat", the GPT3.5 would output something like this: ![image](https://user-images.githubusercontent.com/17372655/228202708-fbc37fac-8206-40e1-bc82-7cd46a064aa3.png) I know this weird output is not due to this extension. I just want to...