<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Multi-Task Learning on Chen Kai Blog</title><link>https://www.chenk.top/en/tags/multi-task-learning/</link><description>Recent content in Multi-Task Learning on Chen Kai Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Thu, 25 Dec 2025 09:00:00 +0000</lastBuildDate><atom:link href="https://www.chenk.top/en/tags/multi-task-learning/index.xml" rel="self" type="application/rss+xml"/><item><title>Recommendation Systems (9): Multi-Task Learning and Multi-Objective Optimization</title><link>https://www.chenk.top/en/recommendation-systems/09-multi-task-learning/</link><pubDate>Thu, 25 Dec 2025 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/recommendation-systems/09-multi-task-learning/</guid><description>&lt;p>A live e-commerce ranker doesn&amp;rsquo;t optimize just one number. The same model that decides which product to show you also predicts, in the same forward pass, whether you will click, add it to your cart, pay for it, return it, or leave a positive review. Each prediction is a different &lt;em>task&lt;/em> with its own data distribution, scarcity, and incentives. These tasks are tightly coupled: a clicker is more likely to convert, a converter is more likely to write a review, and a high-CTR thumbnail can attract clicks that reduce watch time.&lt;/p></description></item><item><title>Transfer Learning (6): Multi-Task Learning</title><link>https://www.chenk.top/en/transfer-learning/06-multi-task-learning/</link><pubDate>Sat, 31 May 2025 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/transfer-learning/06-multi-task-learning/</guid><description>&lt;p>A self-driving car using a single camera needs to do three things simultaneously: detect cars and pedestrians, segment lanes and free space, and estimate the distance of each pixel. Training three separate networks would triple the parameters, require three times as many forward passes at inference, and overlook the fact that all three tasks need the same low-level features (edges, surfaces, occlusion cues).&lt;/p>
&lt;p>Multi-task learning (MTL) is the alternative: one shared backbone, one task-specific head per output, all trained jointly. Done well, you cut parameters by 60% &lt;strong>and&lt;/strong> lift accuracy on every task because each task acts as a regularizer for the others. Done badly, two of your three tasks regress and you waste a week wondering why.&lt;/p></description></item></channel></rss>