O'Reilly Live Course: Open Source Reasoning Language Models

Repository for the course with all material.

Presentation

The slides contain additional background and theroretical information.

Python setup

uv

If possible, work with uv. Clone the repository and run uv sync. However, there are some challenges:

Most of the notebooks work with pyproject.toml.
Currently, the lastest transformers is not compatible with both vllm and unsloth. I recommend using a different kernel for that.
Some notebooks are specifically suited for MacOS using the mlx-lm package. It is only useful to install that with a Mac.

anaconda

Create an venv or conda environment and install the following packages for the normal notebooks:

accelerate
datasets
flash-attn
ipython
jupyter
kernels
liger-kernel
peft
transformers
triton
trl

flash-attn should be installed with the option --no-build-isolation.

For unsloth and vllm, you can use:

datasets
ipykernel
jupyter
transformers
trl
unsloth
vllm

For MacOS notebooks, the following packages are recommended:

jupyter
mlx-lm

I have not provided a requirements.txt as dependencies tend to get outdated faster that I can update.

runpod

You can also use runpod. uv is already preinstalled there. Of course, the MacOS notebooks won't work there.

Notebooks

You can either try to run the notebooks directly or try to follow how I run them and use it as a documentation (or run it later).

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
11-deepseek-distill-qwen3-8.ipynb		11-deepseek-distill-qwen3-8.ipynb
12-qwen3-8.ipynb		12-qwen3-8.ipynb
13-nanbeige-3.ipynb		13-nanbeige-3.ipynb
14-nanbeige-3-tool.ipynb		14-nanbeige-3-tool.ipynb
15-mimo-7.ipynb		15-mimo-7.ipynb
16-gpt-oss-20.ipynb		16-gpt-oss-20.ipynb
17-qwen3-0.6.ipynb		17-qwen3-0.6.ipynb
2026-02-24-Reasoning.pdf		2026-02-24-Reasoning.pdf
21-qwen3-8-awq-vllm.ipynb		21-qwen3-8-awq-vllm.ipynb
31-mlx-deepseek-qwen3-8b.ipynb		31-mlx-deepseek-qwen3-8b.ipynb
32-mlx-qwen-30-3.ipynb		32-mlx-qwen-30-3.ipynb
33-openrouter.ipynb		33-openrouter.ipynb
41-finetune-numinamath-grpo-trl-qwen-complete.ipynb		41-finetune-numinamath-grpo-trl-qwen-complete.ipynb
41-finetune-numinamath-grpo-trl-qwen.ipynb		41-finetune-numinamath-grpo-trl-qwen.ipynb
42-finetune-gsm8k-grpo-trl.ipynb		42-finetune-gsm8k-grpo-trl.ipynb
42-unsloth-qwen3-4-base-complete.ipynb		42-unsloth-qwen3-4-base-complete.ipynb
42-unsloth-qwen3-4-base.ipynb		42-unsloth-qwen3-4-base.ipynb
README.md		README.md
macos.toml		macos.toml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
unsloth.toml		unsloth.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

O'Reilly Live Course: Open Source Reasoning Language Models

Presentation

Python setup

uv

anaconda

runpod

Notebooks

Running different models with GPUs

Running different models on MacOS

Finetuning with GRPO

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

O'Reilly Live Course: Open Source Reasoning Language Models

Presentation

Python setup

uv

anaconda

runpod

Notebooks

Running different models with GPUs

Running different models on MacOS

Finetuning with GRPO

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages