Skip to content

Add docs for the Unicode version implemented by each segmenter #7300

@sffc

Description

@sffc

The docs of each segmenter type (WordSegmenter, LineSegmenter, SentenceSegmenter, GraphemeClusterSegmenter) should have a sentence such as

This segmenter conforms to Unicode 17.0

And we should update that doc string whenever we update the underlying data.

CC @aethanyc @makotokato

Metadata

Metadata

Assignees

No one assigned

    Labels

    C-segmentationComponent: SegmentationT-docs-testsType: Code change outside core librarymilestone-non-blockingThese issues are in a milestone, but do not block the milestone (and can be removed if necessary)

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions