Aliyun PAI (4): PAI-EAS — Model Serving, Cold Starts, and the TPS Lie

Sun, 08 Mar 2026 09:00:00 +0000

EAS is where the money goes. DSW costs a few hundred RMB a month for development. DLC costs spike. EAS bills 24/7 because someone might call your endpoint, and the “minimum replica count” in the autoscaler config is the most critical setting in the entire platform. This article covers what I wish I’d known before shipping our first production endpoint.

PAI-EAS on Chen Kai Blog

Aliyun PAI (4): PAI-EAS — Model Serving, Cold Starts, and the TPS Lie