WNQzhu / SGDPO Public

Notifications You must be signed in to change notification settings
Fork 0
Star 1

SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment [ACL25-Finding]

1 star 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
code		code
README.md		README.md

Repository files navigation

SGDPO-ACL25

Source code for our paper:

SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment

https://arxiv.org/abs/2505.12435

About

SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment [ACL25-Finding]

Report repository

Releases

No releases published

Packages

No packages published

Languages