Lawrence Knight
Results
1
issues of
Lawrence Knight
The example given in open_spiel/python/examples/deep_cfr_pytorch.py fails to converge to the known average policy value of 1/18. It looks like this was tested at one point and converged as expected.
contribution welcome