Michael Brenndoerfer | Data & AI, Private Equity, Technology

Michael Brenndoerfer

History of Language AI

How We Taught Machines to Read, Write, and Reason Through a Hundred Years of Discovery

A journey through the history of language AI, from the early days of information theory to modern large language models. Discover the key breakthroughs, influential figures, and technological advances that shaped how machines understand and generate human language.

For

Historians, researchers, students, AI enthusiasts, and anyone interested in understanding how language AI evolved from theoretical concepts to the transformative technology of today.

Reference

BIBTEXAcademic

@book{historyoflanguageai, author = {Michael Brenndoerfer}, title = {History of Language AI}, year = {November 2025}, url = {https://mbrenndoerfer.com/books/history-of-language-ai}, publisher = {mbrenndoerfer.com}, note = {Accessed: 2025-11-02} }

APAAcademic

Michael Brenndoerfer (November 2025). History of Language AI. Retrieved from https://mbrenndoerfer.com/books/history-of-language-ai

MLAAcademic

Michael Brenndoerfer. "History of Language AI." 2025. Web. 11/2/2025. <https://mbrenndoerfer.com/books/history-of-language-ai>.

CHICAGOAcademic

Michael Brenndoerfer. "History of Language AI." Accessed 11/2/2025. https://mbrenndoerfer.com/books/history-of-language-ai.

HARVARDAcademic

Michael Brenndoerfer (November 2025) 'History of Language AI'. Available at: https://mbrenndoerfer.com/books/history-of-language-ai (Accessed: 11/2/2025).

SimpleBasic

Michael Brenndoerfer (November 2025). History of Language AI. https://mbrenndoerfer.com/books/history-of-language-ai

Direct link:

https://mbrenndoerfer.com/books/history-of-language-ai

Stay Updated

Get notified when new chapters are published.

Stay updated

Get notified when I publish new articles on data and AI, private equity, technology, and more.

History of Language AI

How We Taught Machines to Read, Write, and Reason Through a Hundred Years of Discovery

For

Table of Contents

Part I: Signals & Symbols

Shannon's N-gram Model (1948)

The Turing Test (1950)

Georgetown-IBM Machine Translation Demo (1954)

The Perceptron (1957)

Chomsky's Syntactic Structures (1957)

MADALINE Neural Networks (1962)

ELIZA (1966)

Viterbi Algorithm (1967)

SHRDLU (1968)

Vector Space Model & TF-IDF (1968)

Conceptual Dependency Theory (1969)

The Transition to Statistical Methods (1970s)

Hidden Markov Models (1970s)

Augmented Transition Networks (1970)

Montague Semantics (1973)

Chinese Room Argument (1980)

Part II: The Statistical Turn

Lesk Algorithm (1983)

Backpropagation (1986)

Katz Back-off (1987)

Time Delay Neural Networks (1987)

Convolutional Neural Networks (1988)

IBM Statistical Machine Translation (1991)

Penn Treebank (1993)

BM25 (1994)

WordNet (1995)

Recurrent Neural Networks (1995)

Maximum Entropy & SVMs in NLP (1996)

Long Short-Term Memory (1997)

Statistical Parsers (1997)

FrameNet (1998)

LSA & Topic Models (1999)

Part III: Structured Learning & Benchmarks

Conditional Random Fields (2001)

BLEU Metric (2002)

Phrase-based SMT & MERT (2003)

Neural Probabilistic Language Model (2003)

Latent Dirichlet Allocation (2003)

ROUGE & METEOR (2004)

PropBank (2005)

Freebase (2007)

Part IV: Deep Learning Arrives

IBM Watson on Jeopardy! (2011)

Deep Learning for Speech Recognition (2012)

Wikidata

Word2Vec (2013)

GloVe & Adam Optimizer (2014)

Seq2Seq for MT (2014)

Memory Networks (2014)

Attention Mechanism (2015)

Residual Connections (2015)

Layer Normalization (2016)

Subword Tokenization & FastText (2016)

SQuAD (2016)

Neural Information Retrieval

Google Neural Machine Translation (2016)

WaveNet (2016)

Part V: Transformers & Pretraining

Transformer Architecture (2017)

RLHF Foundations (2017)

ELMo & ULMFiT (2018)

BERT (2018)

GPT-1 & GPT-2 (2018)

GLUE & SuperGLUE (2018)

XLNet, RoBERTa, ALBERT (2019)

XLM (2019)

T5 & Text-to-Text Framework (2019)

Transformer-XL (2019)

BERT for IR (2019)

Part VI: Scaling & Retrieval

Scaling Laws (2020)

GPT-3 & In-Context Learning (2020)

Dense Passage Retrieval & RAG (2020)

Part VII: Multimodal & Instruction Era

Mixture-of-Experts at Scale (2021)