Train batch generic #724

HosseinKaviani-H · 2026-01-22T23:59:05Z

Summary

Adds TrainBatch dataclass that separates model_inputs from loss_inputs, enabling any training paradigm without type changes.

Motivation

The current TextTrainBatch has limitations:

Hardcoded fields require changes for each new training mode
Text-only naming doesn't support multimodal
Every new paradigm (DPO, distillation, etc.) needs type updates

Solution

@dataclass
class TrainBatch:
    model_inputs: dict[str, Any]
    loss_inputs: dict[str, Any]
    meta: dict[str, Any] = field(default_factory=dict)

# Usage:
logits = model(**batch.model_inputs)
loss = loss_fn(logits, **batch.loss_inputs)

Files Changed
File: src/forge/types.py
Change: Added TrainBatch dataclass
────────────────────────────────────────
File: src/forge/rl/collate.py
Change: Updated to return list[TrainBatch] with model_inputs/loss_inputs
────────────────────────────────────────
File: src/forge/actors/trainer/titan.py
Change: Updated train_step() to accept list[TrainBatch] and unpack fields
────────────────────────────────────────
File: apps/grpo/main.py
Change: Updated to pass batch directly: trainer.train_step.call(batch)

Test Plan

Core implementation: types.py, collate.py, titan.py, main.py
Update test files (tests/sandbox/)
Update documentation (docs/)

src/forge/api/trainer.py

felipemello1

i dont think that this class should be in trainer.py. Probably in types.py or something like that. Are you also going to add it to collate and test it in this PR?

joecummings · 2026-01-23T18:08:33Z

i dont think that this class should be in trainer.py. Probably in types.py or something like that. Are you also going to add it to collate and test it in this PR?

Why wouldn't this be in the trainer.py file under api? It defines the training API of which this is part. I would vote to keep it in the trainer API.

felipemello1 · 2026-01-23T20:01:15Z

Why wouldn't this be in the trainer.py file under api?

this is also used collate_fn. Not sure if it may be used in other places. I think we would be exposed to circular dependencies.

e.g. collate imports from train
train imports from X
X imports from collate

Also, thats what other frameworks do, like tinker: https://github.com/thinking-machines-lab/tinker/blob/ad03d44978096b1dcae662e469293e70f509d5a8/src/tinker/types/datum.py#L25

joecummings · 2026-01-23T20:30:43Z

e.g. collate imports from train
train imports from X
X imports from collate

What would X be here? I will not hold up the PR on this point but am curious b/c I have a hard time imagining what that would be.

felipemello1 · 2026-01-23T21:18:35Z

What would X be here?

I will leave that as an exercise for the reader

jk, i guess it cannot happen if collate is its own file and doesnt really import from anywhere. It just makes more sense to me, given the patterns i have seen. But no big deal either way. Worst case we refactor later.

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 22, 2026

HosseinKaviani-H requested a review from felipemello1 January 23, 2026 00:42

felipemello1 reviewed Jan 23, 2026

View reviewed changes

src/forge/api/trainer.py Outdated Show resolved Hide resolved

felipemello1 reviewed Jan 23, 2026

View reviewed changes

Hossein Kavianihamedani added 3 commits January 23, 2026 09:37

Add TrainBatch dataclass for universal training batches

f6e493c

More consice examples

8eb1f77

Move TrainBatch to types.py and update collate imports

e6f1ff8

Update TitanTrainer and GRPO main to use TrainBatch

34af55b

HosseinKaviani-H force-pushed the TrainBatch_Generic branch from 81e475d to 34af55b Compare January 23, 2026 18:32

Hossein Kavianihamedani added 2 commits January 23, 2026 16:11

Update test scripts to use TrainBatch

8cf0f1b

Update test scripts to use TrainBatch

0a988c2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Train batch generic #724

Train batch generic #724

Uh oh!

HosseinKaviani-H commented Jan 22, 2026 •

edited

Loading

Uh oh!

Uh oh!

felipemello1 left a comment

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Train batch generic #724

Are you sure you want to change the base?

Train batch generic #724

Uh oh!

Conversation

HosseinKaviani-H commented Jan 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Solution

Test Plan

Uh oh!

Uh oh!

felipemello1 left a comment

Choose a reason for hiding this comment

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026

Uh oh!

joecummings commented Jan 23, 2026

Uh oh!

felipemello1 commented Jan 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HosseinKaviani-H commented Jan 22, 2026 •

edited

Loading

felipemello1 commented Jan 23, 2026 •

edited

Loading