python_uct
python_uct copied to clipboard
Why is the denominator of Q "1 + self.child_number_visits"?
https://github.com/brilee/python_uct/blob/7e78e3db012118d661889af66eb78e63137234d0/numpy_impl.py#L33
This is to prevent dividing by Zero.
It is different from the paper.