marley
marley copied to clipboard
A framework for multi-agent reinforcement learning.
There's a bug I observed: When there are a bunch of bullets one after another, and the player walks towards them, sometimes some of the bullets don't hit the player,...
Then get the reward from that. Maybe we need a few iterations of that.
Hello there Im very intrigued by the idea of this project. I was thinking that maybe the core logic of it is a bit too "biological" for being basis of...
Add an action to build a wall next to the player. A wall can't be passed by a creature, walking into it results in collision damage. It must be shot...
Calling `predict` for each observation separately is a huge time-sink. We should batch these as much as possible. All creatures of the same strategy should batch their call to `predict`...