Skip to content

分词结果不大准确 #91

@kalxd

Description

@kalxd

目前运行在docker postgres:17版本之上,以zhparser提供dockerfile编译出来的新镜像上运行。

测试我来看望大家这句时,分词的结果好像不大对:

# select * from ts_debug('china', '我来看望大家');

 alias |  description   | token | dictionaries | dictionary | lexemes 
-------+----------------+-------+--------------+------------+---------
 r     | pronoun,代词   | 我    | {}           |            | 
 u     | auxiliary,助词 | 来看  | {}           |            | 
 v     | verb,动词      | 望    | {simple}     | simple     | {望}
 r     | pronoun,代词   | 大家  | {}           |            | 
(4 行记录)

它把来看望分解成来看
请问这种问题有办法解决吗?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions