2048-NN For training, what did you use for determining reward?

For training, what did you use for determining reward?

Open NullVoxPopuli opened this issue 3 years ago • 0 comments

I'm working on a similar project in my free time, and am curious on much info should go into the reward function, how heavy to weight certain actions or failures, etc

Jul 08 '20 18:07 NullVoxPopuli

2048-NN 2048-NN copied to clipboard

For training, what did you use for determining reward?

2048-NN
2048-NN copied to clipboard