ptan icon indicating copy to clipboard operation
ptan copied to clipboard

pollicy functions should use torch functions instead of numpy

Open YiTanJang opened this issue 6 years ago • 0 comments

https://pytorch.org/docs/stable/distributions.html score functions and categorical sampling is already implemented in pytorch, using numpy should be discouraged. policy network should output a probability distribution

YiTanJang avatar Aug 07 '18 06:08 YiTanJang