LLM Alignment

This repository demonstrates practical implementation of language model alignment techniques, including Supervised Fine-Tuning (SFT) and Odds Ratio Preference Optimization (ORPO). The code shows how to align Llama-3.2-1B models to better follow instructions and align with human preferences.

For a detailed explanation of the methods, experiments, and findings, check out blog post.

Also, both models are open-sourced here:

SFTLlama-3.2-1B: https://huggingface.co/KickItLikeShika/SFTLlama-3.2-1B
ORPOLlama-3.2-1B: https://huggingface.co/KickItLikeShika/ORPOLlama-3.2-1B

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
main.py		main.py
orpo.py		orpo.py
sft.py		sft.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Alignment

About

Uh oh!

Releases

Packages

Languages

KickItLikeShika/llm-alignment

Folders and files

Latest commit

History

Repository files navigation

LLM Alignment

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages