sample-factory icon indicating copy to clipboard operation
sample-factory copied to clipboard

High throughput synchronous and asynchronous reinforcement learning

Results 48 sample-factory issues
Sort by recently updated
recently updated
newest added

Integration and documentation with quad-swarm-rl. - fixed video generation by ignoring frames that are none and making a copy of frames for when the env reuses the same object to...

- integrate megaverse into sf2 - report signal_queue size in heartbeat

Initial implementation of an automatic model builder. Adds a default mode, which will generate encoder(s) automatically based on the shape(s) of the input observation(s). Adds a decoder option after the...

@edbeeching I will leave this here so we don't forget. Notes: - cpu_affinity() returns only half of the cores (40 instead of 80) which leads to workers only using 50%...

Good results on Atari is a must for any popular RL framework. Scope of this work: 1) Create `train_atari.py` and `enjoy_atari.py` scripts similar to `train_vizdoom` or `train_isaacgym` that would register...

The env info caching should be improved too. First of all, we need to figure out env info before we can create any of the actual sf2 processes because we...

I am trying to figure out how I could run quick evals during training. I saw an old issue about creating an eval worker using the current infrastructure, but that...

* Make sure we have an example folder and we can run 6 standard Mujoco environments * Try to match results from OpenRLBenchmark in terms of sample efficiency * Try...

Hi, Let's say we have a bunch of features (> 100) and the model is trained and working (but took forever to train). Is there a way to tell which...

Is there a way to restrict sample-factory to only use a subset of the CPUs on the machine instead of splitting them all amongst the number of workers? Thanks!