Ronald Tourtellot
Ronald Tourtellot
I had many of my own updates to the original baselines and environment files, such as changing the logging output to show each of the cpus updating rewards in real...
> I haven't grabbed the new version yet, but I have been fiddling with a negative reward for making a movement that doesn't move the player to encourage better exploration...
> > Are you starting from init state or from the skip having a pokedex already? I'm starting at init and tried a couple of the things you mentioned but...