OMIGA
OMIGA copied to clipboard

→

Metadata

[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"

Reame
Issues

Results 1 OMIGA issues

Sort by recently updated

The confusing transformation about rewards to rtgs.

In the get_episode() function, the rewards have been turned into reward-to-gos, which is not describe in the paper. for agent_trajectory in episode: rtgs = 0 for i in reversed(range(len(agent_trajectory))): rtgs...

RZ-Q

← Metadata

Stars

Forks

Watchers

Owner

ZhengYinan-AIR

Metadata

[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"

Back

OMIGA OMIGA copied to clipboard

Metadata

The confusing transformation about rewards to rtgs.

← Metadata

Owner

Metadata

OMIGA
OMIGA copied to clipboard