Data Science Handbook Cover
In Progress

Data Science Handbook

A Complete Guide to Machine Learning, Optimization and AI — Mathematical Foundations & Practical Implementations

A comprehensive guide covering the mathematical foundations and practical implementations of machine learning, optimization, and artificial intelligence. From fundamental concepts to advanced techniques, this handbook provides both theoretical depth and real-world applications.

In Progress

For

Data scientists, ML engineers, researchers, students, quants, math enthusiasts and anyone interested in the mathematical foundations of machine learning, optimization, and artificial intelligence.

Table of Contents

Part I: Statistics 101

11 chapters
1

Types of Data

Complete guide to data classification - quantitative, qualitative, discrete & continuous

2

Descriptive Statistics

Complete guide to summarizing and understanding data with measures of central tendency, variability, and distribution shape

3

Probability Basics

Foundation of statistical reasoning covering random variables, probability distributions, expected value, variance, and conditional probability

4

Central Limit Theorem

Foundation of statistical inference covering convergence behavior, sample size requirements, and practical applications in data science

5

Data Sampling

Complete guide to sampling theory and methods covering simple random sampling, stratified sampling, cluster sampling, sampling error, and uncertainty quantification

6

Variable Relationships

Complete guide to covariance, correlation, and regression analysis covering how to measure, model, and interpret variable associations

7

Probability Distributions

Complete guide to normal, t-distribution, binomial, Poisson, exponential, and log-normal distributions with practical applications

8

Data Visualization

Complete guide to histograms, box plots, and scatter plots for exploratory data analysis

9

Data Quality

Complete guide to data quality and outliers covering measurement error, bias, missing data, and imputation

10

Statistical Inference

Complete guide to drawing conclusions from data covering point and interval estimation, confidence intervals, hypothesis testing, and p-values

11

Statistical Modelling

Complete guide to building and evaluating predictive models covering model fit metrics, bias-variance tradeoff, and cross-validation

Part II: Foundations

6 chapters

Part III: Regression Models

12 chapters

Part IV: Tree-Based Models

7 chapters

Part V: Explainability

5 chapters

Part VI: Unsupervised Learning

4 chapters
42

K-means Clustering

Partitioning data into k clusters using centroid-based approach

43

DBSCAN (Density-Based Spatial Clustering)

Density-based clustering for arbitrary shaped clusters

44

HDBSCAN (Hierarchical DBSCAN)

Coming Soon

Hierarchical density-based clustering with varying density

45

Hierarchical Clustering

Coming Soon

Tree-based clustering with agglomerative or divisive methods

Part VII: Time Series

5 chapters
46

ETS (Exponential Smoothing)

Coming Soon

Classical time series forecasting with trend and seasonality

47

SARIMA (Seasonal ARIMA)

Coming Soon

Autoregressive integrated moving average with seasonal components

48

Prophet

Coming Soon

Facebook's forecasting tool for business time series with holidays

49

N-BEATS

Coming Soon

Neural basis expansion analysis for interpretable time series forecasting

50

N-HiTS

Coming Soon

Neural hierarchical interpolation for time series forecasting

Part VIII: Optimization

5 chapters
51

CP-SAT Rostering

Coming Soon

Constraint programming for employee scheduling and rostering

52

MILP Factory

Coming Soon

Mixed integer linear programming for production planning

53

Min Cost Flow Slotting

Coming Soon

Network flow optimization for resource allocation

54

VRPTW Routing

Coming Soon

Vehicle routing problem with time windows for logistics

55

QP Portfolio

Coming Soon

Quadratic programming for portfolio optimization and risk management

Coming Soon

This comprehensive handbook is currently in development. Each chapter will be published as it's completed, with mathematical foundations, practical examples, and real-world applications.

Reference

BIBTEXAcademic
@book{datasciencehandbook, author = {Michael Brenndoerfer}, title = {Data Science Handbook}, year = {2025}, url = {https://mbrenndoerfer.com/books/data-science-handbook}, publisher = {mbrenndoerfer.com}, note = {Accessed: 2025-11-12} }
APAAcademic
Michael Brenndoerfer (2025). Data Science Handbook. Retrieved from https://mbrenndoerfer.com/books/data-science-handbook
MLAAcademic
Michael Brenndoerfer. "Data Science Handbook." 2025. Web. 11/12/2025. <https://mbrenndoerfer.com/books/data-science-handbook>.
CHICAGOAcademic
Michael Brenndoerfer. "Data Science Handbook." Accessed 11/12/2025. https://mbrenndoerfer.com/books/data-science-handbook.
HARVARDAcademic
Michael Brenndoerfer (2025) 'Data Science Handbook'. Available at: https://mbrenndoerfer.com/books/data-science-handbook (Accessed: 11/12/2025).
SimpleBasic
Michael Brenndoerfer (2025). Data Science Handbook. https://mbrenndoerfer.com/books/data-science-handbook

Stay Updated

Get notified when new chapters are published.

Stay updated

Get notified when I publish new articles on data and AI, private equity, technology, and more.