MADRaS icon indicating copy to clipboard operation
MADRaS copied to clipboard

Restructuring MADRaS into a Multi-agent MDP

Open Santara opened this issue 5 years ago • 1 comments

Pending work: [Do not merge with master until the list is empty]

  1. Add randomization features
  2. Add training code for rllib
  3. Add done handling for the case when the environment done is set when all the agents hit done.

Santara avatar Dec 15 '19 11:12 Santara

Updates:

For bringing the team up to speed with the latest developments and future actions.

Required:

  • [ ] Simplify code structure. Merge v1 and v2 version of the files (For eg., reward handler, obs handler, and done handler).
  • [ ] Add randomization and experiment automation features to v2.
  • [ ] Make single agent a special case of Multi-Agent structure.

Additional (To be covered in either this or a separate PR):

  • [ ] Unify agent architecture and implement agents with hierarchy (General = > Split (Traffic, Learning)).
  • [ ] Create a general controller interface.
  • [ ] Ground-Up rebuild of the communication buffer.

rudrasohan avatar Oct 05 '20 12:10 rudrasohan