Tagged

LSTM

Oct 11, 2025 NLP 8 min read

NLP Part 3: RNN and Sequence Modeling

How RNNs, LSTMs, and GRUs process sequences with memory. We derive vanishing gradients from first principles, build a character-level text generator, and implement a Seq2Seq translator in PyTorch.

Sep 16, 2024 Time Series Forecasting 9 min read

Time Series Forecasting (2): LSTM -- Gate Mechanisms and Long-Term Dependencies

How LSTM's forget, input, and output gates solve the vanishing gradient problem. Complete PyTorch code for time series forecasting with practical tuning tips.