mgail
mgail copied to clipboard
Fixed std for policy or learned std?
Hi, I have a question regarding this line of code. https://github.com/itaicaspi/mgail/blob/b3b91aa5e0bd47923f726a27522f45146721940d/mgail.py#L109
It seems you are using a fixed std gaussian policy? I am wondering if I am getting it correctly.