Machine Learning from Scratch

Name: Machine Learning from Scratch
Author: Michael Brenndoerfer

Michael Brenndoerfer

Released

For

Data scientists, ML engineers, AI engineers, researchers, students, quants, and anyone serious about understanding machine learning at a fundamental level.

Machine Learning from Scratch

A Complete Guide to Machine Learning, Optimization and AI: Mathematical Foundations and Practical Implementations

43h 27m total read time

66 of 67 chapters published

About This Book

What separates a data scientist who truly understands their craft from one who merely applies black-box tools? The answer lies in mastering the mathematics and intuition behind every algorithm. This comprehensive handbook bridges the gap between theoretical foundations and black-box function calling, giving you the deep understanding that transforms good practitioners into exceptional ones.

From the elegant simplicity of linear regression to the sophisticated power of gradient boosting and neural networks, every concept is built from first principles. You won't just learn how to use scikit-learn. You'll understand exactly what happens under the hood when you call fit() and predict(). Each algorithm is derived mathematically, explained intuitively, and implemented in clean, Python code.

Whether you're a data scientist looking to deepen your expertise, an ML engineer preparing for system design interviews, a researcher needing solid foundations, or a quantitative analyst applying machine learning to financial problems, this handbook provides the rigorous yet accessible treatment you need. Every chapter includes derivations you can follow, visualizations that build intuition, and implementations you can run and extend.

Covering statistics fundamentals, regression techniques, tree-based methods, clustering algorithms, dimensionality reduction, time series forecasting, and mathematical optimization, this is the definitive resource for anyone serious about mastering machine learning from the ground up.

Whether you're a data scientist looking to deepen your expertise, an ML engineer preparing for system design interviews, a researcher needing solid foundations, or a quantitative analyst applying machine learning to financial problems, this handbook provides the rigorous yet accessible treatment you need. Every chapter includes derivations you can follow, visualizations that build intuition, and implementations you can run and extend.

Covering statistics fundamentals, regression techniques, tree-based methods, clustering algorithms, dimensionality reduction, time series forecasting, and mathematical optimization, this is the definitive resource for anyone serious about mastering machine learning from the ground up.

Track Your Progress

Sign in to mark chapters as complete, track quiz scores, and see your reading journey

Sign in →

Reference

BIBTEXAcademic

@book{machinelearningfromscratch, author = {Michael Brenndoerfer}, title = {Machine Learning from Scratch}, year = {2025}, url = {https://mbrenndoerfer.com/books/machine-learning-from-scratch}, publisher = {mbrenndoerfer.com}, note = {Accessed: 2025-01-01} }

APAAcademic

Michael Brenndoerfer (2025). Machine Learning from Scratch. Retrieved from https://mbrenndoerfer.com/books/machine-learning-from-scratch

MLAAcademic

Michael Brenndoerfer. "Machine Learning from Scratch." 2026. Web. today. <https://mbrenndoerfer.com/books/machine-learning-from-scratch>.

CHICAGOAcademic

Michael Brenndoerfer. "Machine Learning from Scratch." Accessed today. https://mbrenndoerfer.com/books/machine-learning-from-scratch.

HARVARDAcademic

Michael Brenndoerfer (2025) 'Machine Learning from Scratch'. Available at: https://mbrenndoerfer.com/books/machine-learning-from-scratch (Accessed: today).

SimpleBasic

Michael Brenndoerfer (2025). Machine Learning from Scratch. https://mbrenndoerfer.com/books/machine-learning-from-scratch

Direct link:

https://mbrenndoerfer.com/books/machine-learning-from-scratch

Stay updated

Get notified when I publish new articles on data and AI, private equity, technology, and more.

No spam, unsubscribe anytime.

or

Create a free account to unlock exclusive features, track your progress, and join the conversation.

No popupsUnobstructed readingCommenting100% Free

For

Machine Learning from Scratch

A Complete Guide to Machine Learning, Optimization and AI: Mathematical Foundations and Practical Implementations

About This Book

Track Your Progress

What's Inside

Statistics 101

Hypothesis Testing

Foundations

Regression Models

Tree-Based Models

Explainability

Table of Contents

Statistics 101

Types of Data

Descriptive Statistics

Probability Basics

Central Limit Theorem

Data Sampling

Variable Relationships

Probability Distributions

Data Visualization

Data Quality

Statistical Inference

Statistical Modelling

Hypothesis Testing

P-values and Hypothesis Test Setup

Confidence Intervals and Test Assumptions

The Z-Test

The T-Test

The F-Test and F-Distribution

ANOVA (Analysis of Variance)

Type I and Type II Errors

Sample Size, Minimum Detectable Effect, and Power

Effect Sizes and Statistical Significance

Multiple Comparisons

Summary and Practical Guide to Hypothesis Testing

Foundations

Sum of Squared Errors (SSE)

R-squared

Standardization

Normalization

Gauss-Markov Assumptions

Multicollinearity

Regression Models

Simple Linear Regression

Ordinary Least Squares (OLS)

Multiple Linear Regression

Lasso Regularization (L1 Regularization)

Ridge Regularization (L2 Regularization)

Elastic Net Regularization

Polynomial Regression

Generalized Linear Models

Logistic Regression

Spline Regression

Poisson Regression

Multinomial Logistic Regression

Tree-Based Models

CART (Classification and Regression Trees)

Random Forest

Boosted Trees

XGBoost

LightGBM

CatBoost

Explainability

SHAP (SHapley Additive exPlanations)

LIME (Local Interpretable Model-agnostic Explanations)

PCA (Principal Component Analysis)

UMAP (Uniform Manifold Approximation and Projection)

t-SNE (t-Distributed Stochastic Neighbor Embedding)

Unsupervised Learning

K-means Clustering

DBSCAN (Density-Based Spatial Clustering)

HDBSCAN (Hierarchical DBSCAN)

Hierarchical Clustering

Isolation Forest

Time Series

ETS (Exponential Smoothing)

SARIMA (Seasonal ARIMA)

Prophet