InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

We provide instructions to reproduce our results below. As you run the .sh and .ipynb files, fill in your local paths where appropriate.

We run our experiments on Qwen3-4B-Instruct-2507.

Step 0: Curate data and setup environment

We provide a dataset of very difficult problems sourced from various datasets, mostly selected by filtering for those with zero reward under 64 rollouts.

Install vllm, datasets, transformers, math-verify, and tqdm in a Python 3.10 experiment.

Step 1: Sample incorrect rollouts

Run sample_rollouts.sh then filter_rollouts.ipynb.

It'll sample rollouts, filter for those that are incorrect, and construct a dataset that has the following columns: problem, answer, reference_solution, incorrect_attempt.

The end result should look like this.

Step 2: Propose interventions

Run propose_interventions.sh then parse_interventions.ipynb.

It'll ask the base model to (i) verify the incorrect attempts and (ii) propose an intervention, and finally parse the interventions.

The end result should look like this.

Step 3: Filter interventions (Optional)

Run sample_guided_rollouts.sh then filter_guided_rollouts.ipynb.

It'll select for interventions that actually lead to correct outcomes, which will serve as the SFT data.

The end result should look like this.

Step 4: SFT on interventions

Perform SFT with Llama Factory. Add the following in dataset_info.json and run sft.sh:

"int_train": {
    "hf_hub_url": "https://huggingface.co/datasets/CMU-AIRe/InT-SFT",
    "split": "train",
    "columns": {
        "prompt": "problem",
        "response": "intervention_guided_attempt"
    }
},

It'll perform SFT on (correct prefix + intervention | problem).

Step 5: RL

Run online RL on the same deduplicated set of problems. We use Pipline RL with int.yaml as the config. Add to pipelinerl/domains/math/load_datasets.py the following:

def process_int(dataset, dataset_name):
    for item in dataset:
        yield {
            "dataset": "item['source']",
            "task": item['problem'],
            "answer": "\\boxed{" + item['answer'] + "}",
        }

and in the load_datasets function, add

if "int" in dataset_names:
    dataset = load_dataset("CMU-AIRe/InT-RL", split="train", trust_remote_code=True)
    samples = [s for s in process_int(dataset, "int") if s is not None]
    logger.info(f"Loading Int dataset: {len(samples)} samples")
    datasets += add_ids(samples)

Step 6: Eval

Run eval.sh.

It'll evaluate the model on our hard eval dataset.

If you run into any problems with reproducing the code, please submit a Github issue!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

Step 0: Curate data and setup environment

Step 1: Sample incorrect rollouts

Step 2: Propose interventions

Step 3: Filter interventions (Optional)

Step 4: SFT on interventions

Step 5: RL

Step 6: Eval

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
eval.sh		eval.sh
filter_guided_rollouts.ipynb		filter_guided_rollouts.ipynb
filter_rollouts.ipynb		filter_rollouts.ipynb
grader.py		grader.py
int.yaml		int.yaml
parse_interventions.ipynb		parse_interventions.ipynb
propose_interventions.py		propose_interventions.py
propose_interventions.sh		propose_interventions.sh
sample_guided_rollouts.py		sample_guided_rollouts.py
sample_guided_rollouts.sh		sample_guided_rollouts.sh
sample_rollouts.py		sample_rollouts.py
sample_rollouts.sh		sample_rollouts.sh
sft.sh		sft.sh

Folders and files

Latest commit

History

Repository files navigation

InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning

Step 0: Curate data and setup environment

Step 1: Sample incorrect rollouts

Step 2: Propose interventions

Step 3: Filter interventions (Optional)

Step 4: SFT on interventions

Step 5: RL

Step 6: Eval

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages