OMIGA icon indicating copy to clipboard operation
OMIGA copied to clipboard

[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"

Results 1 OMIGA issues
Sort by recently updated
recently updated
newest added

In the get_episode() function, the rewards have been turned into reward-to-gos, which is not describe in the paper. for agent_trajectory in episode: rtgs = 0 for i in reversed(range(len(agent_trajectory))): rtgs...