<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>GradNorm on Chen Kai Blog</title><link>https://www.chenk.top/en/tags/gradnorm/</link><description>Recent content in GradNorm on Chen Kai Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 31 May 2025 09:00:00 +0000</lastBuildDate><atom:link href="https://www.chenk.top/en/tags/gradnorm/index.xml" rel="self" type="application/rss+xml"/><item><title>Transfer Learning (6): Multi-Task Learning</title><link>https://www.chenk.top/en/transfer-learning/06-multi-task-learning/</link><pubDate>Sat, 31 May 2025 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/transfer-learning/06-multi-task-learning/</guid><description>&lt;p>A self-driving car using a single camera needs to do three things simultaneously: detect cars and pedestrians, segment lanes and free space, and estimate the distance of each pixel. Training three separate networks would triple the parameters, require three times as many forward passes at inference, and overlook the fact that all three tasks need the same low-level features (edges, surfaces, occlusion cues).&lt;/p>
&lt;p>Multi-task learning (MTL) is the alternative: one shared backbone, one task-specific head per output, all trained jointly. Done well, you cut parameters by 60% &lt;strong>and&lt;/strong> lift accuracy on every task because each task acts as a regularizer for the others. Done badly, two of your three tasks regress and you waste a week wondering why.&lt;/p></description></item></channel></rss>