2048-NN icon indicating copy to clipboard operation
2048-NN copied to clipboard

For training, what did you use for determining reward?

Open NullVoxPopuli opened this issue 3 years ago • 0 comments

I'm working on a similar project in my free time, and am curious on much info should go into the reward function, how heavy to weight certain actions or failures, etc

NullVoxPopuli avatar Jul 08 '20 18:07 NullVoxPopuli