Reinforcement Learning (2): Q-Learning and Deep Q-Networks (DQN)

Wed, 06 Aug 2025 09:00:00 +0000

In December 2013, a small DeepMind team uploaded a paper to arXiv with a striking claim: a single neural network, trained from raw pixels and the score, learned to play seven Atari games — and beat the previous best on six of them. No game-specific features. No hand-coded heuristics. The same architecture for Pong, Breakout, and Space Invaders. The algorithm was Deep Q-Network (DQN), and it kicked off the deep reinforcement learning era.

Experience Replay on Chen Kai Blog

Reinforcement Learning (2): Q-Learning and Deep Q-Networks (DQN)