Skip to content

对于生僻字的拆分会失败 #1049

@qixurulin

Description

@qixurulin

jieba.add_word('大黄䓬虫',freq=1000,tag='rr')
jieba.suggest_freq('大黄䓬虫', tune=True)

str='大黄䓬虫'
words= pseg.cut(str,HMM=True)
print(list(words))

输出:[pair('大黄', 'rr'), pair('䓬', 'x'), pair('虫', 'n')]

盐酸地尔硫䓬,同样也会失败,不知道有没有人遇到和我一样的问题?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions