Dreamer

Aug 21, 2025 Reinforcement Learning 28 min read

Reinforcement Learning (5): Model-Based RL and World Models

From Dyna and MBPO to World Models, Dreamer, and MuZero -- how learning a model lets agents plan in imagination and reach expert performance with 10-100x fewer real interactions.