PCGrad
PCGrad copied to clipboard
Experiments on MetaWorld
Hello, Thank you for your work! I'm currently trying to reimplement your method on meta world and have a question regarding the multi-head SAC you in the paper. Are both q net and policy net multi-head models in your case or only the policy net is multi-head? Thanks!