Reinforcement Learning (12): RLHF and LLM Applications

Thu, 25 Sep 2025 09:00:00 +0000

GPT-3 (June 2020) and ChatGPT (November 2022) share most of their weights. The base model could write fluent prose, complete code, and continue any pattern you gave it. Yet, when asked a simple question, it might ramble, refuse for the wrong reasons, hallucinate citations, or produce toxic content. The two and a half years between GPT-3 and ChatGPT weren’t spent on larger transformers. Instead, they focused on how to make the model useful — a reinforcement-learning problem.

ChatGPT on Chen Kai Blog

Reinforcement Learning (12): RLHF and LLM Applications