UnicodeSet parser does not support all code points

The unescaping code in `icu_unicodeset_parser` only works for scalar values (Rust `char`'s), when all code points should be supported (any `u32` below or equal `char::MAX`). Should be relatively straightforward to fix by replacing chars with u32s and a `val <= char::MAX as u32` check instead of `char::try_from` in `parse_escaped_char`.

This currently fails, but should pass: `icu_unicodeset_parser::parse(r"[^\uD800-\uE0FF]")`

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

UnicodeSet parser does not support all code points #3893

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

UnicodeSet parser does not support all code points #3893

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions