Skip to content

Add verifiers-based RL training#15

Open
tomoqt wants to merge 1 commit intovanilla_inner_loopingfrom
codex/refactor-training-loop-for-async-generation
Open

Add verifiers-based RL training#15
tomoqt wants to merge 1 commit intovanilla_inner_loopingfrom
codex/refactor-training-loop-for-async-generation

Conversation

@tomoqt
Copy link
Copy Markdown
Owner

@tomoqt tomoqt commented Jun 4, 2025

Summary

  • create train_verifiers_nmr.py training script using the verifiers library
  • document usage in README

Testing

  • python -m py_compile training/train_verifiers_nmr.py

https://chatgpt.com/codex/tasks/task_b_68403b91087083279442f77326e563a3

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant