Jiachen W

Results 10 comments of Jiachen W

Maybe simply enable `reanalyze`?

kindly try [EfficientZero](https://github.com/YeWR/EfficientZero), which also controls the reanalyze part with a fraction argument.

Purely state-based RL should have been much easier than pixel-based RL. You can still use it except with different inputs.

> Hello, I think the situation you describe correspond to a Multi agent setting. In this case, MuZero is not really suitable for this type of configuration. You could also...

same problem... so sad

使用2.12.14依然提示invalid origin Manjaro Linux x86_64 5.4.195-1-MANJARO 我直接下载最新版的压缩包解压后运行,并且删除了旧版本

chech here: https://spinningup.openai.com/en/latest/algorithms/trpo.html

Hi, may I ask that if it is possible for us to develop our own version of LOOT, it looks really cool!

SAC不是已经有了吗?

> Maybe the reason is the texlive in sharelatex container not support xelatex. Reference this issue, I run these command in container to solve this problem. [https://github.com/overleaf/overleaf/issues/703#issuecomment-571502872](url) > > `...