marley issues

There's a bug I observed: When there are a bunch of bullets one after another, and the player walks towards them, sometimes some of the bullets don't hit the player,...

cool-RR

Research temporal-difference learning

cool-RR

Observation should be used to predict next observation instead of reward

Then get the reward from that. Maybe we need a few iterations of that.

cool-RR

Research off-policy learning

cool-RR

Improve `measure` by including states in which the food isn't in a straight line

cool-RR

Core logic

4

Hello there Im very intrigued by the idea of this project. I was thinking that maybe the core logic of it is a bit too "biological" for being basis of...

arashmh

Make the algorithm actually produce smart behavior

1

cool-RR

Add an action to build a wall

Add an action to build a wall next to the player. A wall can't be passed by a creature, walking into it results in collision damage. It must be shot...

cool-RR

Optimization: Make all the creatures calculate their next step in parallel

Calling `predict` for each observation separately is a huge time-sink. We should batch these as much as possible. All creatures of the same strategy should batch their call to `predict`...

cool-RR

marley
marley copied to clipboard

Metadata

Bundle immutabledict instead of requiring it

Fix bug with player walking into oncoming bullets

Research temporal-difference learning

Observation should be used to predict next observation instead of reward

Research off-policy learning

Improve `measure` by including states in which the food isn't in a straight line

Core logic

Make the algorithm actually produce smart behavior

Add an action to build a wall

Optimization: Make all the creatures calculate their next step in parallel

← Metadata

Owner

Metadata

marley marley copied to clipboard

Metadata

← Metadata

Owner

Metadata

marley
marley copied to clipboard