RLs
RLs copied to clipboard
实现新的强化学习算法
-
MARL:
- [x] MADDPG
- [x] MASAC 1346949
- [x] IQL
- [x] VDN
- [x] Q-MIX
- [x] Qatten ad8be31
- [ ] MAPPO
- [ ] COMA
- [ ] QTRAN-alt
- [x] QTRAN-base 4c45ba0
- [x] QPLEX 92d4b9a
-
SARL:
- Model-free
- [ ] CEM
- [x] TRPO 67b8979
- [x] NPG 71115ea
- [ ] FQF
- Model-based:
- [x] Dreamer b7d88a1
- [x] MVE 14c9bfc
- [ ] STEVE
- [ ] MBPO
- [x] PlaNet 7965bcf
- [x] DreamerV2 7f988d4
- Offline:
- [ ] BC
- [x] CQL 026ba1d
- [x] BCQ d60741c
- [ ] AWR
- [ ] BRAC
- Model-free
- [ ] 优化MARL中的训练部分,避免繁多的键值索引