Fix WordStoppingCriteria device mismatch #3

Copilot · 2025-10-30T09:03:09Z

Summary

Successfully addressed review feedback:

Removed intermediate variable: Changed from device = input_ids.device followed by device=device to directly using device=input_ids.device inline
Moved tests to dedicated file: Created tests/test_word_stopping_criteria.py with all 5 WordStoppingCriteria tests and MockTokenizer class

All 51 tests pass, linting passes with no issues.

Original prompt

This section details on the original issue you should resolve

<issue_title>[BUG] WordStoppingCriteria does not respect input device</issue_title>
<issue_description>### Current Behavior

In the call method, regardless of the device of input_ids, the output is on the CPU. This results in device mismatch error in downstream tasks.
def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor, **kwargs) -> torch.BoolTensor:
        texts = [self.tokenizer.decode(ids) for ids in input_ids]
        is_done = [is_word_complete(text) for text in texts]
        return torch.BoolTensor(is_done)
Expected Behavior
@add_start_docstrings(STOPPING_CRITERIA_INPUTS_DOCSTRING)
    def __call__(self, input_ids: torch.LongTensor, scores: torch.FloatTensor, **kwargs) -> torch.BoolTensor:
        device = input_ids.device
        texts = self.tokenizer.batch_decode(
            input_ids.tolist(),  
            skip_special_tokens=False  # 
        )
        is_done_list = [is_word_complete(t) for t in texts]  
        return torch.tensor(is_done_list, dtype=torch.bool, device=device)
Convert the boolean into input_ids device when returning. Also add batch decoding for faster performance.

</issue_description>

<agent_instructions>make a PR only adding the device to the bool tensor</agent_instructions>

Comments on the Issue (you are @copilot in this section)

Fixes [BUG] WordStoppingCriteria does not respect input device #2

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: AmitMY <5757359+AmitMY@users.noreply.github.com>

AmitMY

@copilot please change based on my notes

AmitMY · 2025-10-30T09:20:12Z

words_segmentation/pretokenizer.py

        texts = [self.tokenizer.decode(ids) for ids in input_ids]
        is_done = [is_word_complete(text) for text in texts]
-        return torch.BoolTensor(is_done)
+        return torch.tensor(is_done, dtype=torch.bool, device=device)


use device=input_ids.device directly

Done in commit 58d73a7. Removed the intermediate device variable and now using device=input_ids.device directly in the tensor constructor.

AmitMY · 2025-10-30T09:20:43Z

tests/test_pretokenizer.py

    assert not is_word_complete("こんにちは")


+def test_word_stopping_criteria_basic():


move these tests to a tests/test_word_stopping_criteria.py

Done in commit 58d73a7. Moved all WordStoppingCriteria tests and the MockTokenizer class to tests/test_word_stopping_criteria.py.

… file Co-authored-by: AmitMY <5757359+AmitMY@users.noreply.github.com>

Initial plan

2bed423

Copilot AI assigned Copilot and AmitMY Oct 30, 2025

Copilot started work on behalf of AmitMY October 30, 2025 09:03 View session

Copilot AI and others added 3 commits October 30, 2025 09:07

Initial plan for fixing WordStoppingCriteria device issue

5df751d

Co-authored-by: AmitMY <5757359+AmitMY@users.noreply.github.com>

Add tests for WordStoppingCriteria before implementing device fix

4f34582

Co-authored-by: AmitMY <5757359+AmitMY@users.noreply.github.com>

Fix WordStoppingCriteria to respect input device

1ab5b4a

Co-authored-by: AmitMY <5757359+AmitMY@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix WordStoppingCriteria device handling for input_ids~~ Fix WordStoppingCriteria device mismatch Oct 30, 2025

Copilot AI requested a review from AmitMY October 30, 2025 09:16

Copilot finished work on behalf of AmitMY October 30, 2025 09:16

AmitMY requested changes Oct 30, 2025

View reviewed changes

Copilot started work on behalf of AmitMY October 30, 2025 09:21 View session

Address review feedback: use device inline and move tests to separate…

58d73a7

… file Co-authored-by: AmitMY <5757359+AmitMY@users.noreply.github.com>

Copilot AI requested a review from AmitMY October 30, 2025 09:28

Copilot finished work on behalf of AmitMY October 30, 2025 09:28

AmitMY approved these changes Oct 30, 2025

View reviewed changes

AmitMY marked this pull request as ready for review October 30, 2025 09:31

AmitMY merged commit df69c6e into main Oct 30, 2025
4 checks passed

AmitMY deleted the copilot/fix-word-stopping-criteria branch October 30, 2025 09:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix WordStoppingCriteria device mismatch #3

Fix WordStoppingCriteria device mismatch #3

Uh oh!

Copilot AI commented Oct 30, 2025 •

edited

Loading

Uh oh!

AmitMY left a comment

Uh oh!

AmitMY Oct 30, 2025

Uh oh!

Copilot AI Oct 30, 2025

Uh oh!

AmitMY Oct 30, 2025

Uh oh!

Copilot AI Oct 30, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		assert not is_word_complete("こんにちは")


		def test_word_stopping_criteria_basic():

Uh oh!

Fix WordStoppingCriteria device mismatch #3

Fix WordStoppingCriteria device mismatch #3

Uh oh!

Conversation

Copilot AI commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Expected Behavior

Comments on the Issue (you are @copilot in this section)

Uh oh!

AmitMY left a comment

Choose a reason for hiding this comment

Uh oh!

AmitMY Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

AmitMY Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Oct 30, 2025 •

edited

Loading