-
Notifications
You must be signed in to change notification settings - Fork 3
Open
Description
I want to use Greynir-Correct for correction of non-whole sentences, i.e. in extreme cases single words. What method or options should I use to make that possible ?
Currently, when using the tokenize() method with option only_ci=True, it complains about the following:
Maðurin Z002 Orð á að byrja á hástaf: 'maðurin'
Maðurinn Z002 Orð á að byrja á hástaf: 'maðurinn'
Sample code:
from reynir_correct import tokenize
texts = ["maðurin", "maðurinn" ]
for t in texts:
g = tokenize(t, only_ci=True)
for t in g:
if t.txt:
print(f"{t.txt:12} {t.error_code:8} {t.error_description}")Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels