Skip to content

Remove sentence-end from search tree and handle its scoring in the search algorithms#183

Open
larissakl wants to merge 2 commits intomultiple_label_scorersfrom
new-sentence-end-handling
Open

Remove sentence-end from search tree and handle its scoring in the search algorithms#183
larissakl wants to merge 2 commits intomultiple_label_scorersfrom
new-sentence-end-handling

Conversation

@larissakl
Copy link
Contributor

The sentence-end symbol is removed from the search tree again. In the timesync-searches, all involved label scorers score sentence-end at the end of a sequence in finalizeHypotheses(), which is then added to the overall hypothesis score. In the tree search, this is only done for hypotheses that are in a final state of the search tree (currently all root nodes). In this case, the word-level LM score for the sentence-end token is also included.

Even though it is no longer necessary, I kept the logic that handles two consecutive exits in the tree search. I think it won't have any negative effects and could maybe be useful in the future, but if you prefer, I would also agree with removing it.

Depends on #172.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant