Skip to content

Conversation

@eggrobin
Copy link
Member

@eggrobin eggrobin commented Jan 9, 2026

Checklist

  • Required: Issue filed: ICU-23307
  • Required: The PR title must be prefixed with a JIRA Issue number. Example: "ICU-NNNNN Fix xyz"
  • Required: Each commit message must be prefixed with a JIRA Issue number. Example: "ICU-NNNNN Fix xyz"
  • Issue accepted (done by Technical Committee after discussion)
  • Tests included, if applicable
  • API docs and/or User Guide docs changed or added, if applicable
  • Approver: Feel free to merge on my behalf

@eggrobin eggrobin marked this pull request as draft January 9, 2026 17:22
@eggrobin
Copy link
Member Author

eggrobin commented Jan 9, 2026

Somehow this is affecting Han-Latin/Names; Latin-Bopomofo, even though neither appears to contain string literals in UnicodeSets.

@eggrobin
Copy link
Member Author

eggrobin commented Jan 9, 2026

Latin-NumericPinyin.xml:

($tone) ( [i o n u {o n} {n g}]) → $2 &Pinyin-NumericPinyin($1);

Whyyy??!

@eggrobin
Copy link
Member Author

eggrobin commented Jan 9, 2026

de-ASCII also does this:

$AE = [Ä {A \u0308}];
$OE = [Ö {O \u0308}];
$UE = [Ü {U \u0308}];

[ä {a \u0308}] → ae;
[ö {o \u0308}] → oe;
[ü {u \u0308}] → ue;

and blt-fonipa-t-blt:

$DIGRAPHS = [{ꪹ  ꪸ} {ꪹ  ꪷ} {ꪹ ꪱ}];

These seem to be the only users of space-insensitivity of string literals.

@macchiati
Copy link
Member

macchiati commented Jan 9, 2026 via email

@eggrobin
Copy link
Member Author

eggrobin commented Jan 9, 2026

An artifact, that we just never noticed before the change

I added something to the CLDR-design agenda. (Not sure I can make it Monday, I expect this week to be chaotic—I am moving Friday—, but I’m sure you have enough context to discuss this topic.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants