Tags

Text Preprocessing

Oct 1, 2025 NLP 34 min read

NLP (1): Introduction and Text Preprocessing

A first-principles introduction to NLP and text preprocessing. We trace the four eras of the field, build the cleaning to vectorization pipeline by hand, and unpack the math behind tokenization, TF-IDF, n-grams, and …