Portfolio · 2026

陈锴 / Chen Kai

Engineer · Researcher · Builder of agent systems

I build systems on top of long-running agents — marketing automation, autonomous research, agent control planes. I care most about the unglamorous parts that quietly hold everything together: observability, token economics, skill evolution, and shared memory.

9 active projects 5 public 4 internal
Beijing · Remote
Replies usually within 24h
Filter
public 2026
controller email creative video voice

AI4Marketing

Product · Architecture · Full-stack

Operators only ever speak to the controller. Everything else — email blasts, ad creative, video, TTS — is handled by sub-agents that own their own domain. The whole platform is built so a single sentence can become a campaign.

Qwen3-Max DashScope Wanxiang Video Qwen-TTS OSS Nginx
11Aliyun products
170+test cases
20verify modules
llm4marketing.com
public 2024–now

chenk.top

Writing · Theme · Image pipeline

The site you're reading. Hugo with a hand-rolled theme, ~400 bilingual long-form pieces, ~1500 matplotlib figures pushed through a homegrown pipeline to Aliyun OSS. Designed to feel like a magazine, not a blog template.

Hugo Custom theme matplotlib OSS CDN
~400articles
~1500figures
14+series
internal 2025–now

AI4Science

Architecture · Agent control plane

A control plane for long-running research agents that read papers, run experiments, and write reports. The design hinges on a shared memory architecture plus harness-based skill evolution — failed attempts compress into lessons that feed the next run.

Claude Code Gemini DingTalk Stream SQLite A2A orchestration
Design principles
  • Shared memory as the inter-agent protocol
  • Lesson compression instead of explicit supervision
  • Cross-provider token-economics dashboard
In production · internal use
internal 2025–now

Research Agent

Pipeline design · Writer sub-process

An autonomous pipeline that takes a question and returns a finished report. The writer sub-pipeline is decomposed into small, reentrant agents tied to a skill-evolution loop — every failure feeds back into the harness and shared memory, raising the floor each run.

harness shared memory writer pipeline skill evolution
Design principles
  • One door in: question to report
  • Writer split into reentrant micro-agents
  • Failures compressed into reusable skills
Internal research system
internal 2025

CK Planet

Agent control plane · Live dashboard

A live dashboard for agent operations: session timelines, rate-limit budgets, and token economics across providers. The premise is to treat agents as real systems with cost and failure modes — not as a prompt.

provider-agnostic rate limiting log streaming session timelines
Design principles
  • A single abstraction across providers
  • Defensive rate limiting before performance
  • Logs as a subscribable, replayable stream
Internal infrastructure
internal 2025

Memory System

Design pattern · Personal use

File-based persistent memory across Claude Code sessions. User profile, feedback, project state, and references each get their own type. MEMORY.md acts as an index; concrete fragments load lazily on demand.

typed memory MEMORY.md index lazy semantic recall
Design principles
  • Memory typed by purpose, never mixed
  • Index always loaded, body loaded on demand
  • Files are the source of truth — greppable, diffable
Pattern in long-term personal use
public series · 5 parts

Aliyun PAI Practical Guide

From PAI-Designer visual modelling to EAS elastic inference — covering every critical node of a machine-learning production chain.

PAI-Designer DSW EAS
public series · 5 parts

Aliyun Bailian Practical Guide

From model selection to agent orchestration — translating Bailian's raw capabilities into shippable product surfaces.

Qwen RAG Agent app
public series · 8 parts

Terraform for AI Agents

Provisioning agent systems with Terraform: a plan-validate-apply loop that turns natural-language intent into auditable, reversible real-world resources.

Terraform IaC plan-validate-apply
Currently exploring

A few things on my desk

01

Agent token economics

How should a long-running agent split its budget across providers? Which steps are worth the most expensive model?

02

Limits of skill evolution

Which failures can compress into reusable skills, and which can only be discarded?

03

Schemas for shared memory

When multiple agents share one memory, how strict should the type system be?

04

Prompt to product

The last mile from agent demo to shipped product — observability, replay, governance, user trust.

Get in touch

Interesting projects, collaborations, or just ideas — all welcome.