Tagged

Function Calling

Feb 26, 2026 Aliyun Bailian 6 min read

Aliyun Bailian (2): The Qwen LLM API in Production

Picking a Qwen variant by latency and cost, function calling done right, JSON mode without tears, and the enable_thinking + streaming requirement that the docs gloss over.