Skip to content
@DSML-book

DSML-book

Data Science and Machine Learning: Mathematical and Statistical Methods

Welcome to the GitHub site for:

Data Science and Machine Learning: Mathematical and Statistical Methods
by Dirk P. Kroese, Zdravko I. Botev, Thomas Taimre, and Radislav Vaisman

The purpose of this book is to provide an accessible, yet comprehensive textbook intended for students interested in gaining a better understanding of the mathematics and statistics that underpin the rich variety of ideas and machine learning algorithms in data science.

Key Features:

  • Focuses on mathematical understanding.
  • Presentation is self-contained, accessible, and comprehensive.
  • Extensive list of exercises and worked-out examples.
  • Many concrete algorithms with Python code.
  • Full color throughout.

📢 Good news!

  • The first edition of this book is available free of charge for educational purposes or self-study. You are welcome to download the PDF, provided that you give due acknowledgement to the source.
  • The second edition is out now (20 November, 2025)! Click here for details and updates

Citation

In bibtex, please cite the first edition as follows:

@book{kroese2019dsml,
  title     = {Data Science and Machine Learning: Mathematical and Statistical Methods},
  author    = {D.P. Kroese and Z.I. Botev and T. Taimre and R. Vaisman},
  year      = {2019},
  publisher = {Chapman & Hall/CRC},
  address = {Boca Raton}
}

and the second edition as:

@book{botev2025dsml,
  title     = {Data Science and Machine Learning: Mathematical and Statistical Methods},
  author    = {Z.I. Botev and D.P. Kroese and T. Taimre},
  edition   = {Second},
  year      = {2025},
  publisher = {Chapman & Hall/CRC},
  address = {Boca Raton}
}

Repositories

💻 Programs — Python code for the first edition

💻 Programs2e — Python code for the second edition

🎓 Lectures - Lecture slides for the first edition

PDFs

📖 Book - Complete first edition

🐞 Errata - Corrections for the first edition

✅ Solutions - Solutions for odd-numbered questions of the first edition


🏠 Home Page - Additional resources



How to Download a Repository

1. Install Git

If you don’t have Git installed, download and install it from: https://git-scm.com/downloads

  • Windows: During installation, select "Git Bash" when prompted. This will allow you to use Git commands easily.
  • macOS: You can install Git via Homebrew with:
    brew install git
  • Linux: Use your package manager, for example:
    sudo apt-get install git

2. Open Git Bash (Windows) or Terminal (macOS/Linux)

After installation:

  • On Windows, search for Git Bash in the Start menu and open it.
  • On macOS/Linux, open your Terminal.

3. Clone the Repository

In the terminal or Git Bash, run:

git clone https://github.com/DSML-book/Programs2e.git

This will create a folder named Programs2e with all the files for the second edition situated in the Programs2e repository.


Alternative: Download ZIP File

If you don’t want to use Git, you can also download the ZIP file:

Popular repositories Loading

  1. Programs Programs Public

    Programs

    Python 116 79

  2. Odd-numbered-Solutions Odd-numbered-Solutions Public

    Solutions manual with odd-numbered solutions

    26 14

  3. Lectures Lectures Public

    21 18

  4. Errata Errata Public

    11 5

  5. Programs2e Programs2e Public

    Programs for the 2nd edition

    Python

  6. .github .github Public

Repositories

Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…