Recommendation-Systems on Chen Kai Blog

Recommendation Systems (16): Industrial Architecture and Best Practices

Thu, 15 Jan 2026 09:00:00 +0000

The hardest part of a production recommendation system isn’t the model. It’s the system around the model: the feature store that prevents training/serving skew, the canary deployment that catches regressions before they hit 100M users, and the orchestration that meets a 100ms p95 latency budget while running four ML models in sequence. This final article describes the architecture that every major tech company has converged on — and the trade-offs within each layer.

Recommendation Systems (15): Real-Time Recommendation and Online Learning

Mon, 12 Jan 2026 09:00:00 +0000

A user opens your app at 14:02 and searches for ’trail running shoes’. By 15:30, they’ve moved on to reading kitchen reviews. A model that retrains nightly still shows them Salomon ads at 16:00 — and that gap is exactly the bug a real-time system fixes. The interesting part isn’t ‘make it faster’ but ‘what should be fast’ — most features add nothing to AUC even when made real-time, and the wrong design point wastes money without improving performance.

Recommendation Systems (14): Cross-Domain Recommendation and Cold-Start Solutions

Fri, 09 Jan 2026 09:00:00 +0000

When Netflix launches in a new country, it inherits millions of users with no history and a catalog with no local ratings. Amazon faces the same issue each time it opens a new product category. Pure collaborative filtering, the workhorse of warm-state recommendations, has nothing to compute. Techniques that make recommendations work in this scenario include: bootstrap heuristics for the first request, meta-learning after a few interactions, cross-domain transfer when a related domain is rich, and bandits to keep exploring once the model is confident. This post walks through these techniques, anchored to the papers they come from.

Recommendation Systems (13): Fairness, Debiasing, and Explainability

Tue, 06 Jan 2026 09:00:00 +0000

A user opens Spotify and the same fifty songs keep appearing. They open Amazon and the top results are always the items they have already considered. They open YouTube and every recommendation is one click away from a rabbit hole they cannot remember asking for. Each of these symptoms has a name, a cause, and a fix. This article is about all three.

Recommendation Systems (12): Large Language Models and Recommendation

Sat, 03 Jan 2026 09:00:00 +0000

A user opens a movie app and types: “Something like Inception, but less depressing.” A traditional recommender — collaborative filtering, two-tower DNN, even DIN — sees zero useful tokens here. It has no like button to count, no co-watch graph to traverse, no user ID with history. The query has to be turned into IDs before the system can do anything.

A Large Language Model has the opposite problem: it has too much world knowledge but doesn’t know who this user is. It knows Inception is a Christopher Nolan film with non-linear narrative and a hopeful-but-ambiguous ending; it knows what “depressing” means in cinema; it can name twenty films that fit. But it can’t tell you which of those twenty the current user has already seen, rated badly, or left half-watched.

Recommendation Systems (11): Contrastive Learning and Self-Supervised Learning

Wed, 31 Dec 2025 09:00:00 +0000

Classical recommenders learn from one signal: did a user click, watch, or buy? That signal is precious, but it is also brutally sparse. Most users touch fewer than 1% of the catalogue, most items are touched by fewer than 0.1% of users, and a brand-new item or user has nothing at all. Optimising a model directly against such sparse labels almost guarantees overfitting on the head and silence on the tail.

Recommendation Systems (10): Deep Interest Networks and Attention Mechanisms

Sun, 28 Dec 2025 09:00:00 +0000

A good chef doesn’t cook the same dish for every guest. She watches you walk in, notes the wine you order, and glances at how you eye the chalkboard — then decides whether tonight’s special should be the steak or the risotto. Your past visits matter, but only the parts that fit this mood.

A recommendation model used to be a worse chef. It would take everything the user had ever clicked, average it into a single vector, and serve the same dish to everyone in the room. The vintage leather jacket you viewed last week and the random phone charger you clicked six months ago carried equal weight, regardless of what you’re looking at now.

Recommendation Systems (9): Multi-Task Learning and Multi-Objective Optimization

Thu, 25 Dec 2025 09:00:00 +0000

A live e-commerce ranker doesn’t optimize just one number. The same model that decides which product to show you also predicts, in the same forward pass, whether you will click, add it to your cart, pay for it, return it, or leave a positive review. Each prediction is a different task with its own data distribution, scarcity, and incentives. These tasks are tightly coupled: a clicker is more likely to convert, a converter is more likely to write a review, and a high-CTR thumbnail can attract clicks that reduce watch time.

Recommendation Systems (8): Knowledge Graph-Enhanced Recommendation

Mon, 22 Dec 2025 09:00:00 +0000

When you search for The Dark Knight on a streaming platform, the system doesn’t just log that you watched it. It knows that Christian Bale played Batman, Christopher Nolan directed it, it’s part of the Batman trilogy, and it shares cinematic DNA with other cerebral action films. This rich semantic web is a knowledge graph (KG) — a structured network of entities (movies, actors, directors, genres) connected by typed relations (acted_in, directed_by, part_of).

Recommendation Systems (7): Graph Neural Networks and Social Recommendation

Fri, 19 Dec 2025 09:00:00 +0000

When Netflix decides what to recommend next, it does not look at your watch history in isolation. Behind the scenes there is a web of relationships: movies that share actors, users with overlapping taste, ratings that ripple through the catalogue. The “graph” view is not a metaphor — every interaction matrix is a graph, and treating it as one unlocks ideas that flat user/item embeddings cannot express.

Graph neural networks (GNNs) are the tool that lets us reason over that graph. Instead of learning each user and each item in isolation, a GNN says: your representation is shaped by the company you keep. That single shift powers Pinterest’s billion-node PinSage, the strikingly simple LightGCN that beats heavier baselines on collaborative filtering, and the social-recommendation systems that fuse “what you watched” with “what your friends watched.”

Recommendation Systems (6): Sequential Recommendation and Session-based Modeling

Tue, 16 Dec 2025 09:00:00 +0000

When you scroll TikTok, every recommendation feels eerily on-point — not because the system reads your mind, but because it reads the order of what you just watched. A cooking video followed by a travel vlog tells a different story than the same two clips in reverse. That ordering is exactly the signal that sequential recommenders are built to exploit.

Compare two friends recommending shows. The first knows your favourite genres but never asks what you watched last week. The second says, “You just finished three sci-fi thrillers in a row — try this one.” Traditional collaborative filtering is friend one. Sequential recommendation is friend two.

Recommendation Systems (5): Embedding and Representation Learning

Sat, 13 Dec 2025 09:00:00 +0000

When Netflix suggests Inception to someone who just finished The Dark Knight, the magic is not a hand-crafted “if-watched-Nolan-then” rule. It is geometry. Both films sit close together in a 128-dimensional embedding space that the model has learned from billions of viewing events. Geometry replaces enumeration: instead of comparing a movie to fifteen thousand others through brittle similarity rules, the system asks a single question — how far apart are these two vectors?

Recommendation Systems (4): CTR Prediction and Click-Through Rate Modeling

Wed, 10 Dec 2025 09:00:00 +0000

Every time you scroll through a social-media feed, click a product recommendation, or watch a suggested video, a CTR (click-through rate) model decides what to show you. These models answer one deceptively small question:

“What is the probability that this specific user will click on this specific item, right now?”

Behind that question lies one of the most economically valuable problems in machine learning. A 1% lift in CTR translates into millions of dollars at the scale of Google, Amazon, or Alibaba — and the same models also drive video feeds, app stores, news apps, and dating apps. CTR prediction sits at the heart of the ranking stage: candidate generation gives you a few thousand items, and the CTR model decides which dozen actually reach the user.

Recommendation Systems (3): Deep Learning Foundations

Sun, 07 Dec 2025 09:00:00 +0000

In June 2016, Google published a one-page paper that quietly redrew the map of recommendation systems. The paper described Wide & Deep Learning, the model then powering app recommendations inside Google Play — a billion-user product. Within a year, every major tech company had a deep model in production. By 2019, the industry standard had shifted: matrix factorization was a baseline, not a system.

What changed? Multi-layer neural networks brought four capabilities classical methods could not deliver:

Recommendation Systems (2): Collaborative Filtering and Matrix Factorization

Thu, 04 Dec 2025 09:00:00 +0000

You finish The Shawshank Redemption and want something with the same feeling. A genre filter would surface every prison drama ever made, most of them awful. Collaborative filtering takes a different route: it never looks at the movie itself. It looks at people who watched what you watched and asks what else they loved.

That single idea — let the crowd’s behaviour speak — powers Amazon, YouTube, Spotify and every modern feed. This article unpacks the algorithms behind it, from the neighbourhood methods of the 1990s to the matrix-factorization models that won the Netflix Prize.

Recommendation Systems (1): Fundamentals and Core Concepts

Mon, 01 Dec 2025 09:00:00 +0000

Open Netflix and the homepage somehow knows you. Scroll TikTok and the next video is the one you didn’t realise you wanted. Drop into Spotify on a Monday morning and Discover Weekly serves up thirty songs you’ve never heard of, and you save half of them.

None of this is magic. It is one of the most commercially successful applications of machine learning, quietly running behind almost every consumer product you use: the recommendation system.