寇谦

Results 6 issues of 寇谦

I want to konw if there are any researches focusing on leveraging RLHF in MARL problems.

![training result of maddpg solve simple_tag_v2](https://github.com/Git-123-Hub/maddpg-pettingzoo-pytorch/assets/62536298/859b8f23-071d-48e7-aa74-46bae3bce123)

Hello! The class MediumLevelPlanner used in overcooked.py seems to be deprecated in the latest overcooked_ai version, which class can be used instead?

In the get_episode() function, the rewards have been turned into reward-to-gos, which is not describe in the paper. for agent_trajectory in episode: rtgs = 0 for i in reversed(range(len(agent_trajectory))): rtgs...

Could you please provide the api server.py for chunk convert