Add reward function for SynLogic dataset #123

LiqunMa · 2025-08-14T16:46:39Z

Checklist Before Starting

Search for similar PR(s).

What does this PR do?

Add one-line overview of what this PR aims to achieve or accomplish.

Add reward functions for SynLogic dataset.

High-Level Design

Demonstrate the high-level design if this PR is complex.

None

Specific Changes

List the specific changes.

Add files at verl/utils/reward_score

API

Demonstrate how the API changes if any.

None

Usage Example

Provide usage example(s) for easier usage.

# bash scripts/train/example_singlenode_rl_qwen7b_synlogic.sh

Test

For changes that can not be tested by CI (e.g., algorithm implementation, new model support), validate by experiment(s) and show results like training curve plots, evaluatuion results, etc.

Additional Info.

Issue Number: Fixes issue # or discussion # if any. None
Training: [Note which backend this PR will affect: FSDP, Megatron, both, or none] None
Inference: [Note which backend this PR will affect: vLLM, SGLang, both, or none] None

Checklist Before Submitting

[Y] Read the Contribute Guide.
[Y] Apply pre-commit checks.
[Y] Add [BREAKING] to the PR title if it breaks any API.
[Y] Update the documentation about your changes in the docs.
[Y] New CI unit test(s) are added to cover the code path.
[Y] Rely on existing unit tests on CI that covers the code path.

Jianshu-She and others added 6 commits July 25, 2025 12:23

[data] Add IFBench dataset (#113)

bf5c3dc

add synlogic

3a830ea

del

fe2e413

add all synlogic verifier

9bcf1d4

add train scripts

de34d41

del the code for debug

b0b6347

LiqunMa requested a review from haonan-li August 14, 2025 16:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add reward function for SynLogic dataset #123

Add reward function for SynLogic dataset #123

Uh oh!

LiqunMa commented Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add reward function for SynLogic dataset #123

Are you sure you want to change the base?

Add reward function for SynLogic dataset #123

Uh oh!

Conversation

LiqunMa commented Aug 14, 2025

Checklist Before Starting

What does this PR do?

High-Level Design

Specific Changes

API

Usage Example

Test

Additional Info.

Checklist Before Submitting

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants