In Chinese word segmentation, only a single word is separated

Execute the following code (tabooSegmentCustomDicList there are more than 2000 words)
`
for _, tabooSegmentCustomDic := range tabooSegmentCustomDicList {
		lowerCaseWord := strings.ToLower(tabooSegmentCustomDic.Word)
		segmentutil.AddWord(lowerCaseWord)
	}


func AddWord(word string) bool {
	defer recoverPanic(word)
	err := seg.AddToken(word, 100)
	if err != nil {
		logger.Errorf("Error when AddWord,%s", word, err)
		return false
	}
	return true
}


func TextSegment(text string) []string {
	defer recoverPanic(text)
	return seg.Cut(text)
}

`

TextSegment("api发送文本loumès 𝘾𝘼𝙍𝙏𝙄𝙀𝙍")

the result is ["api","发","送","文","本","lou","mès"," ","𝘾𝘼𝙍𝙏𝙄𝙀𝙍"]


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

In Chinese word segmentation, only a single word is separated #176

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

In Chinese word segmentation, only a single word is separated #176

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions