Alibaba Cloud Full Stack (10): Bailian and DashScope — The LLM Layer

Thu, 07 May 2026 09:00:00 +0000

When I first needed an LLM API for a production app in China, my options were limited and expensive. Most international providers had no mainland endpoint, billing required a foreign credit card, and latency from calling US-based APIs was 800ms+ before a single token came back. Then Qwen showed up on DashScope with an OpenAI-compatible endpoint, and suddenly building AI products in China became as straightforward as anywhere else. Same SDK, same request shape, same streaming protocol — just a different base_url and a key from the Bailian console. I have been running production workloads against it for over a year now, and this article is the comprehensive walkthrough I wish I had on day one.

Aliyun Bailian (2): The Qwen LLM API in Production

Thu, 26 Feb 2026 09:00:00 +0000

This article in the series covers most of the production wins. While the other models are interesting, the LLMs are what every product I’ve shipped on Bailian calls every minute of every day. The official Qwen API reference is dense and complete; this article is the readable companion that guides you through it.

Pick the right Qwen variant for the workload#

The Qwen family is large. Some teams overspend by defaulting to qwen-max everywhere; others underspend on quality by defaulting to qwen-turbo. The right answer is “match variant to job”:

Qwen on Chen Kai Blog

Alibaba Cloud Full Stack (10): Bailian and DashScope — The LLM Layer

Aliyun Bailian (2): The Qwen LLM API in Production

Pick the right Qwen variant for the workload#