<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>QMIX on Chen Kai Blog</title><link>https://www.chenk.top/en/tags/qmix/</link><description>Recent content in QMIX on Chen Kai Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Wed, 10 Sep 2025 09:00:00 +0000</lastBuildDate><atom:link href="https://www.chenk.top/en/tags/qmix/index.xml" rel="self" type="application/rss+xml"/><item><title>Reinforcement Learning (9): Multi-Agent Reinforcement Learning</title><link>https://www.chenk.top/en/reinforcement-learning/09-multi-agent-rl/</link><pubDate>Wed, 10 Sep 2025 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/reinforcement-learning/09-multi-agent-rl/</guid><description>&lt;p>Single-agent RL rests on one quiet but enormous assumption: the environment is stationary. The transition kernel does not change while the agent learns. The moment a second learner shares the world, that assumption collapses. Each agent now sees an environment whose dynamics shift as its peers update, rewards become entangled across agents, and the joint action space explodes combinatorially. These are not engineering nuisances. They are the reason multi-agent RL needs its own algorithms instead of just &lt;em>running DQN n times in parallel&lt;/em>.&lt;/p></description></item></channel></rss>