fix: pass full sequence to stopping criteria in generation loop #104

Jameswlepage · 2026-01-26T05:47:43Z

What:

Bug Fix
New Feature

Description:

The stopping criteria in the generation loop was incorrectly receiving $generatedInputIds (only the newly generated token from the current step) instead of $allInputIds (the full sequence including prompt and all generated tokens).

This caused MaxLengthCriteria to never trigger based on sequence length, because it was always checking a sequence of length 1, which would never exceed max_length. As a result, text generation would run indefinitely until hitting memory limits or an EOS token.

The Fix:

- $stop = $stoppingCriteria($generatedInputIds, $scores);
+ $stop = $stoppingCriteria($allInputIds, $scores);

Testing:

Verified that maxNewTokens parameter now correctly limits generation length.

The stopping criteria was incorrectly receiving $generatedInputIds (only the newly generated token from the current step) instead of $allInputIds (the full sequence including prompt and all generated tokens). This caused MaxLengthCriteria to never trigger because it was always checking a sequence of length 1, which would never exceed max_length.

Jameswlepage closed this by deleting the head repository Jan 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: pass full sequence to stopping criteria in generation loop #104

fix: pass full sequence to stopping criteria in generation loop #104

Uh oh!

Jameswlepage commented Jan 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: pass full sequence to stopping criteria in generation loop #104

fix: pass full sequence to stopping criteria in generation loop #104

Uh oh!

Conversation

Jameswlepage commented Jan 26, 2026

What:

Description:

The Fix:

Testing:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant