Skip to content

Replace duplicated hyperparams in SDPOLossInput with TrainingConfig

8530a65
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Merged

Fix PEFT base-model contamination and tune SDPO defaults #37

Replace duplicated hyperparams in SDPOLossInput with TrainingConfig
8530a65
Select commit
Loading
Failed to load commit list.
lint-and-test
succeeded Feb 24, 2026 in 57s