Skip to content

Pull requests: allenai/open-instruct

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add DPO OLMo-core support with MFU improvements
#1440 opened Jan 30, 2026 by finbarrtimbers Loading…
3 tasks
Fixes SFT checkpointing
#1435 opened Jan 28, 2026 by finbarrtimbers Loading…
Runs the benchmarks with the hybrid model
#1425 opened Jan 26, 2026 by finbarrtimbers Loading…
[WIP] Add generic RL environment support
#1419 opened Jan 25, 2026 by hamishivi Loading…
4 of 7 tasks
Add tool docs
#1410 opened Jan 21, 2026 by hamishivi Loading…
Validates artifacts
#1407 opened Jan 21, 2026 by finbarrtimbers Loading…
Add GRPO main entry point and scripts (GRPO olmo-core: PR 5 of 5)
#1399 opened Jan 20, 2026 by finbarrtimbers Loading…
1 of 3 tasks
Add OLMo-core Ray actor (GRPO olmo-core: PR 4 of 5)
#1398 opened Jan 20, 2026 by finbarrtimbers Loading…
1 of 2 tasks
Use simple-parsing for DPO argument parsing
#1393 opened Jan 20, 2026 by finbarrtimbers Loading…
3 tasks
Refactor DPO config: move fields and remove duplicates
#1392 opened Jan 20, 2026 by finbarrtimbers Loading…
3 tasks
Add OLMo-core GRPO trainer implementation
#1389 opened Jan 20, 2026 by finbarrtimbers Loading…
Add fp32 LM head option for GRPO
#1387 opened Jan 19, 2026 by natolambert Loading…
3 tasks done
Update dependencies for OLMo-core trainer
#1378 opened Jan 16, 2026 by finbarrtimbers Loading…
2 of 3 tasks
smolzero
#1330 opened Jan 9, 2026 by mnoukhov Draft
ProTip! no:milestone will show everything without a milestone.