QMIX

Sep 10, 2025 Reinforcement Learning 28 min read

Reinforcement Learning (9): Multi-Agent Reinforcement Learning

A working tour of multi-agent RL: Markov games, the non-stationarity and credit-assignment problems, CTDE, value decomposition (VDN, QMIX), counterfactual baselines (COMA), MADDPG, communication topologies, and the …