<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>LSTM on Chen Kai Blog</title><link>https://www.chenk.top/en/tags/lstm/</link><description>Recent content in LSTM on Chen Kai Blog</description><generator>Hugo</generator><language>en</language><lastBuildDate>Sat, 11 Oct 2025 09:00:00 +0000</lastBuildDate><atom:link href="https://www.chenk.top/en/tags/lstm/index.xml" rel="self" type="application/rss+xml"/><item><title>NLP (3): RNN and Sequence Modeling</title><link>https://www.chenk.top/en/nlp/rnn-sequence-modeling/</link><pubDate>Sat, 11 Oct 2025 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/nlp/rnn-sequence-modeling/</guid><description>&lt;p>Open Google Translate, swipe-type a message, or dictate a memo to your phone — all these systems consume an ordered stream of tokens and produce another. A feed-forward network processes each input independently, but language is fundamentally &lt;strong>sequential&lt;/strong>: the meaning of &amp;ldquo;mat&amp;rdquo; in &lt;em>the cat sat on the mat&lt;/em> depends on every word that came before. Recurrent Neural Networks (RNNs) handle this by maintaining a &lt;strong>hidden state&lt;/strong> that evolves as they process each token. The hidden state is the network&amp;rsquo;s running summary of the past — its memory.&lt;/p></description></item><item><title>Time Series Forecasting (2): LSTM — Gate Mechanisms and Long-Term Dependencies</title><link>https://www.chenk.top/en/time-series/lstm/</link><pubDate>Mon, 16 Sep 2024 09:00:00 +0000</pubDate><guid>https://www.chenk.top/en/time-series/lstm/</guid><description>&lt;p>The first RNN I ever trained, back in 2017, was a small sales forecaster: 50 days in, the next day out. The forward pass ran cleanly, the loss went down, and yet the model had near-total amnesia about anything older than three days. The data had a clear monthly cycle. The model couldn&amp;rsquo;t see it. I assumed I needed more data, so I added rows and layers — and watched the training loss jump to NaN halfway through epoch two.&lt;/p></description></item></channel></rss>