Skip to content

Chinese text extraction is not correct #1

@haifenghuang

Description

@haifenghuang

As Title suggested, Below code:

keyword_processor := NewKeywordProcessor()
keyword_processor.AddKeywords("欢迎")
keyword_processor.AddKeywords("来")
keyword_processor.AddKeywords("北京")

result := keyword_processor.ExtractKeywords("欢迎来北京")

for _, v := range result {
    e := ExtractResult(v)
    fmt.Printf("return : %s\n", e.Keyword)
}

There is nothing in the output, because len(result) = 0.

If we change above keywords to english:

keyword_processor := NewKeywordProcessor()
keyword_processor.AddKeywords("welcome")
keyword_processor.AddKeywords("to")
keyword_processor.AddKeywords("beijing")

result := keyword_processor.ExtractKeywords("welcome to beijing")

for _, v := range result {
    e := ExtractResult(v)
    fmt.Printf("return : %s\n", e.Keyword)
}

The result is:

return : welcome
return : to
return : beijing

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions