Aleksei Petrenko comments

Results 117 comments of


                                            Aleksei Petrenko

TypeError: cannot deepcopy this pattern object

I think this is better asked in the swarm-rl repo. Also, would be great if you could post the whole error trace. For now, I suggest changing `--replay_buffer_sample_prob=0.75` to `--replay_buffer_sample_prob=0`...

Run tasks on Docker failed. No module named 'megaverse.extension'

@GoingMyWay thank you for reporting! @BoyuanLong can you please take a look?

Run tasks on Docker failed. No module named 'megaverse.extension'

@erikwijmans is right. The loop is trying to iterate self.env_runners which is None. It is, of course, not supposed to be None. Very likely something happened earlier in the log,...

Sf2 swarm

This is not ready to merge yet, right?

Can we train maddpg-pytorch on sample-factory?

Hello! SampleFactory is not a simulator, but a reinforcement learning algorithm. If you're looking for fast simulators, check out our Megaverse: https://www.megaverse.info/ It supports multi-agent training at 10^5-10^6 samples per...

Can we train maddpg-pytorch on sample-factory?

Megaverse is the RL environment. I believe it should be similar to using any other RL environment with this algorithm. There is not documentation fort this specific algorithm

How to generate fixed, deterministic episode on dmlab30?

Hi! Can you explain what exactly you mean by a deterministic episode? I.e. you have a trained policy, and you want to evaluate it in a way that is consistent...

[question] How to run evals during training?

Hi @nathanlct I'd say the easiest way is to modify the actor worker class to make one of them "special" in some way. I.e. you can make actor_worker #0 into...

[question] How to run evals during training?

Hi @nathanlct ! Sorry for the delay > I am trying to put something together but am still struggling to understand how the code works. > > So from what...

Improvements to env info cache

@edbeeching maybe you can take a look if you have time! :)