-
Notifications
You must be signed in to change notification settings - Fork 491
Pull requests: allenai/open-instruct
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add DPO OLMo-core support with MFU improvements
#1440
opened Jan 30, 2026 by
finbarrtimbers
Loading…
3 tasks
Significantly improves
dpo.py performance: ~40% MFU
#1430
opened Jan 27, 2026 by
finbarrtimbers
•
Draft
Add vllm_dtype parameter to create_vllm_engines
#1426
opened Jan 26, 2026 by
finbarrtimbers
Loading…
[WIP] Add generic RL environment support
#1419
opened Jan 25, 2026 by
hamishivi
Loading…
4 of 7 tasks
Add GRPOTrainModule subclassing TransformerTrainModule (GRPO olmo-core: PR 2 of 5)
#1412
opened Jan 22, 2026 by
finbarrtimbers
Loading…
Now, the GPU tests CI action automatically appends the result to prevent it from re-running.
#1409
opened Jan 21, 2026 by
finbarrtimbers
Loading…
Add optional wandb system metrics logging for generator process
#1403
opened Jan 20, 2026 by
jacob-morrison
•
Draft
Add GRPO main entry point and scripts (GRPO olmo-core: PR 5 of 5)
#1399
opened Jan 20, 2026 by
finbarrtimbers
Loading…
1 of 3 tasks
Add OLMo-core Ray actor (GRPO olmo-core: PR 4 of 5)
#1398
opened Jan 20, 2026 by
finbarrtimbers
Loading…
1 of 2 tasks
Add GRPO callbacks for OLMo-core Trainer (GRPO olmo-core: PR 3 of 5)
#1397
opened Jan 20, 2026 by
finbarrtimbers
Loading…
Use simple-parsing for DPO argument parsing
#1393
opened Jan 20, 2026 by
finbarrtimbers
Loading…
3 tasks
Refactor DPO config: move fields and remove duplicates
#1392
opened Jan 20, 2026 by
finbarrtimbers
Loading…
3 tasks
Update dependencies for OLMo-core trainer
#1378
opened Jan 16, 2026 by
finbarrtimbers
Loading…
2 of 3 tasks
Bumps
vllm version to 0.13.0 and Dockerfile to CUDA 12.9.
#1372
opened Jan 15, 2026 by
finbarrtimbers
Loading…
Adds a new GRPO implementation that uses Olmo-core
#1329
opened Jan 8, 2026 by
finbarrtimbers
•
Draft
Previous Next
ProTip!
no:milestone will show everything without a milestone.