OMIGA
OMIGA copied to clipboard
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"
Results
1
OMIGA issues
Sort by
recently updated
recently updated
newest added
In the get_episode() function, the rewards have been turned into reward-to-gos, which is not describe in the paper. for agent_trajectory in episode: rtgs = 0 for i in reversed(range(len(agent_trajectory))): rtgs...