O'Reilly Live Course: Finetuning Open Source Large Language Models

Repository for the course with all material.

Presentation

The slides contain additional background and theroretical information.

Python setup

uv

If possible, work with uv. Clone the repository and run uv sync.

anaconda

Create an venv or conda environment and install the following packages:

ipykernel
ipython
ipywidgets
jupyter
tqdm
transformers
sentence-transformers
bitsandbytes
datasets
flash-attn
liger-kernel
peft
trl
unsloth

flash-attn should be installed with the option --no-build-isolation.

Of course, you can also Use the supplied requirements.txt, but some dependencies might be outdated.

runpod

You can also use runpod. uv is already preinstalled there.

Notebooks

You can either try to run the notebooks directly or try to follow how I run them and use it as a documentation (or run it later).

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
10-prepare-dataset-finetune.ipynb		10-prepare-dataset-finetune.ipynb
10000_All_Beauty.json.xz		10000_All_Beauty.json.xz
11-bert-finetune-classification.ipynb		11-bert-finetune-classification.ipynb
12-alternative-zeroshot.ipynb		12-alternative-zeroshot.ipynb
2000_All_Beauty.json.xz		2000_All_Beauty.json.xz
2025-11-17 Finetuning.pdf		2025-11-17 Finetuning.pdf
2026-01-15 Finetuning.pdf		2026-01-15 Finetuning.pdf
21-sbert-finetune.ipynb		21-sbert-finetune.ipynb
22-create-sbert-data-qwen-reranker.ipynb		22-create-sbert-data-qwen-reranker.ipynb
23-sbert-finetune-qwen-reranker.ipynb		23-sbert-finetune-qwen-reranker.ipynb
31a-qwen3-07-full-finetune.ipynb		31a-qwen3-07-full-finetune.ipynb
31b-qwen3-07.ipynb		31b-qwen3-07.ipynb
32a-llama32-1-huggingface.ipynb		32a-llama32-1-huggingface.ipynb
32b-llama32-1.ipynb		32b-llama32-1.ipynb
33a-smolm-1-huggingface.ipynb		33a-smolm-1-huggingface.ipynb
33b-smolm-1.ipynb		33b-smolm-1.ipynb
34a-phi3-unsloth.ipynb		34a-phi3-unsloth.ipynb
34b-phi3.ipynb		34b-phi3.ipynb
README.md		README.md
ai-abstract-dataset.jsonl.xz		ai-abstract-dataset.jsonl.xz
ai-abstracts.jsonl.xz		ai-abstracts.jsonl.xz
llm-abstract-dataset.jsonl.xz		llm-abstract-dataset.jsonl.xz
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

O'Reilly Live Course: Finetuning Open Source Large Language Models

Presentation

Python setup

uv

anaconda

runpod

Notebooks

Classification

Similarity (embedding) finetuning

Generative model finetuning

Full finetune of a Qwen SLM with 700 million parameters

LoRA finetune of a Llama model with 1.7 billion parameters

LoRA finetune of a SmoLm model from Hugging Face

LoRA finetune of a Phi 3.5 model with ~ 4 billion parameters

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

O'Reilly Live Course: Finetuning Open Source Large Language Models

Presentation

Python setup

uv

anaconda

runpod

Notebooks

Classification

Similarity (embedding) finetuning

Generative model finetuning

Full finetune of a Qwen SLM with 700 million parameters

LoRA finetune of a Llama model with 1.7 billion parameters

LoRA finetune of a SmoLm model from Hugging Face

LoRA finetune of a Phi 3.5 model with ~ 4 billion parameters

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages