<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Agents on Chen Kai Blog</title><link>https://www.chenk.top/en/tags/agents/</link><description>Recent content in Agents on Chen Kai Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Thu, 02 Apr 2026 09:00:00 +0000</lastBuildDate><atom:link href="https://www.chenk.top/en/tags/agents/index.xml" rel="self" type="application/rss+xml"/><item><title>LLM Engineering (7): Function Calling and Tool Use</title><link>https://www.chenk.top/en/llm-engineering/07-function-calling/</link><pubDate>Thu, 02 Apr 2026 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/llm-engineering/07-function-calling/</guid><description>&lt;p>Function calling connects an LLM to the world outside its weights. It combines chat-template details (&lt;a href="https://www.chenk.top/en/llm-engineering/02-tokenization/">Chapter 2&lt;/a>
), structured-output kernels (&lt;a href="https://www.chenk.top/en/llm-engineering/05-inference/">Chapter 5&lt;/a>
), and prompt engineering (&lt;a href="https://www.chenk.top/en/llm-engineering/09-prompting/">Chapter 9&lt;/a>
). This chapter explores what happens under the hood, the guarantees you can rely on, and the agent-loop patterns that handle real workloads.&lt;/p>
&lt;p>The intellectual lineage matters. Tool use as an LLM capability traces back to two near-simultaneous papers in 2022: &lt;strong>MRKL Systems&lt;/strong> (Karpas et al., AI21) which proposed expert-routing among neuro-symbolic modules, and &lt;strong>ReAct&lt;/strong> (&lt;a href="https://arxiv.org/abs/2210.03629" target="_blank" rel="noopener noreferrer">Yao et al., 2022 &lt;span aria-hidden="true" style="font-size:0.75em; opacity:0.55; margin-left:2px;">↗&lt;/span>&lt;/a>
) which interleaved chain-of-thought reasoning with tool actions. &lt;strong>Toolformer&lt;/strong> (&lt;a href="https://arxiv.org/abs/2302.04761" target="_blank" rel="noopener noreferrer">Schick et al., 2023 &lt;span aria-hidden="true" style="font-size:0.75em; opacity:0.55; margin-left:2px;">↗&lt;/span>&lt;/a>
) showed self-supervised teaching of tool use, generating training data by having a model insert tool-call markers into existing text. By 2024 every frontier model had post-training data structured around the tool-use format, and tool calling moved from &amp;ldquo;research demo&amp;rdquo; to &amp;ldquo;API feature.&amp;rdquo;&lt;/p></description></item><item><title>NLP (12): Frontiers and Practical Applications</title><link>https://www.chenk.top/en/nlp/frontiers-applications/</link><pubDate>Tue, 25 Nov 2025 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/nlp/frontiers-applications/</guid><description>&lt;p>We have spent eleven chapters climbing from raw text to multimodal foundation models. This twelfth and final chapter sits at the frontier and at the runway. It is where research stops being a paper and starts being a service: an LLM that calls tools, writes and debugs code, reasons through hundred-step problems, ingests a 200K-token contract, and serves a thousand concurrent users behind a FastAPI endpoint with p95 latency under 300 ms.&lt;/p></description></item></channel></rss>