Skip to content

Fix dataset bug and tokenizer type hint#34

Merged
SYSTEMS-OPERATOR merged 2 commits intomainfrom
codex/fix-bug-and-evaluate-ci-errors
Jun 27, 2025
Merged

Fix dataset bug and tokenizer type hint#34
SYSTEMS-OPERATOR merged 2 commits intomainfrom
codex/fix-bug-and-evaluate-ci-errors

Conversation

@SYSTEMS-OPERATOR
Copy link
Owner

Summary

  • fix negative index bug in TokenIDDataset.__iter__
  • correct return type annotation for BytePairTokenizer.train_bpe

Testing

  • pytest -q

https://chatgpt.com/codex/tasks/task_e_685e1106c86c8324b67c6dc8244463d8

@SYSTEMS-OPERATOR SYSTEMS-OPERATOR merged commit 5a0f517 into main Jun 27, 2025
1 check passed
@SYSTEMS-OPERATOR SYSTEMS-OPERATOR deleted the codex/fix-bug-and-evaluate-ci-errors branch June 27, 2025 03:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant