assistive-gym ValueError when using SAC with co-optimization

ValueError when using SAC with co-optimization

Open zxhuang97 opened this issue 4 years ago • 1 comments

Thank you for sharing this wonderful repository. When I try to run experiments with co-optimization, PPO is fine. But when I try SAC there is a strange error.

ValueError: Have multiple policies {'human': <ray.rllib.policy.tf_policy_template.SACTFPolicy object at 0x7f8ec4436470>, 'robot': <ray.rllib.policy.tf_policy_template.SACTFPolicy object at 0x7f8ebc685ef0>}, but the env <NormalizeActionWrapper<FeedingSawyerHumanEnv instance>> is not a subclass of BaseEnv, MultiAgentEnv or ExternalMultiAgentEnv?

This seems to be related to this issue of RLlib.

Feb 15 '21 07:02 zxhuang97

Thanks for pointing out this issue! I have been looking into this recently, but haven't found the solution quite yet. I'll keep you updated.

Mar 05 '21 21:03 Zackory

assistive-gym assistive-gym copied to clipboard

ValueError when using SAC with co-optimization

assistive-gym
assistive-gym copied to clipboard