CLDR-17223 Add nestedBracketReplacement for use in display names #5240

sffc · 2025-12-17T01:22:47Z

CLDR-17223

This PR completes the ticket.

NOTE: The CLDR implementation of locale display names needs to be updated with the new algorithm.

ALLOW_MANY_COMMITS=true

sffc · 2025-12-17T02:22:58Z

Do I need to add this to any of the following files where I find moreInformation referenced:

TestCoverageLevel.txt
missingOk.txt
prettyPath.txt
PathHeader.txt
PathDescriptions.md
IdToPath.java
ExampleDependencies.java
TestXPathTable.java

Also, about this test failure:

Error:  (TestExampleGenerator.java:327)  Error: No example:	<[>	"//ldml/characters/nestedBracketReplacement[@source=\"([^\"]*+)\"]",

I want this to not appear in survey tool, because it's easy to get wrong and it hardly ever changes. How should I do this? Should I add it to TestExampleGenerator::DELIBERATE_EXCLUDED_EXAMPLES like moreInformation is?

conradarcturus · 2025-12-19T19:41:37Z

I was worried about not every language using parentheses (thereby a substitution could fail) but parentheses are pretty universal for localePattern https://www.unicode.org/cldr/charts/48/by_type/locale_display_names.locale_name_patterns.html#4c3249dbb329101c

Can you test out the algorithm that would implement this on RTL locales? I want to make sure it doesn't cause problems.

Naming-wise, I like "innerBracket" (a bracket that is in a bracket) over "nestedBracketReplacement", but I defer to you.

Per usual, that's my 2 cents but I am happy to follow your lead. I'm deferring the accept to someone who is a more frequent Design group member.

macchiati · 2026-01-13T19:21:13Z

The stuff to update is in https://cldr.unicode.org/development/updating-dtds.

The way to hide stuff in the survey tool is not obvious. It is to use a ; HIDE suffix in PathHeader.txt

Example:
//ldml/dates/timeZoneNames/zone[@type="Etc/(GMT|UTC)(.*)"]/exemplarCity ; Special ; Suppress ; Etc/$1$2 ; exemplarCity ; HIDE

I'll file a ticket to fix that.

macchiati

The ticket has been accepted, so this is not blocked by that.

I have one open question; otherwise this looks good.

macchiati · 2026-01-13T19:10:22Z

common/supplemental/coverageLevels.xml


 <!-- Moved up as part of change to moderate -->
 		<coverageLevel value="moderate" match="characters/ellipsis[@type='%ellipsisTypes']"/>
+		<coverageLevel value="moderate" match="characters/nestedBracketReplacement"/>


I think coverage might need to be complete, that is, have the source attribute. But the value can be '%A'

(this comment is on an old version of the PR before I figured out how to do it)

macchiati · 2026-01-13T19:13:02Z

docs/ldml/tr35-general.md

-| hi-u-nu-latn-t-en-h0-hybrid   | Hindi (Hybrid: English, Western Digits) |
-| en-u-nu-deva-t-de             | English (Transform: German, Devanagari Digits) |
-| fr-z-zz-zzz-v-vv-vvv-u-uu-uuu-t-ru-Cyrl-s-ss-sss-a-aa-aaa-x-u-x | French (Transform: Russian \[Cyrillic\], uu: uuu, a: aa-aaa, s: ss-sss, v: vv-vvv, x: u-x, z: zz-zzz) |
+| hi-u-nu-latn-t-en-h0-hybrid   | Hindi (Western Digits, Hybrid: English) |


Interesting. The question I have is, when there are both T and U, which is "most important" for the user. Example, I think that for hi-Latn, the fact that it is a hybrid might be more important.

Also need to check; I think we might have a specialized name for hi-Latin, namely Hinglish

I changed the order in order to make it more clear that the Unicode extension keywords apply to the main language and not the transform language. Example:

my-IN-t-en-MM-u-nu-latn

Current Spec Burmese (India, Transform: English [Myanmar [Burma]], Latin Digits)

Flatten, current order Burmese (India, Transform: English, Myanmar [Burma], Latin Digits)

Flatten, new order Burmese (India, Latin Digits, Transform: English, Myanmar [Burma])

With "Flatten, current order", it's not clear that "Latin Digits" applies to "Burmese (India)" and not "English (Myanmar [Burma])"

Ok, makes sense

sffc · 2026-01-15T11:14:07Z

Can you test out the algorithm that would implement this on RTL locales? I want to make sure it doesn't cause problems.

I don't see how this would adversely impact RTL locales relative to not performing the replacement. We're swapping one bracket for another, with the same bidi classes.

Naming-wise, I like "innerBracket" (a bracket that is in a bracket) over "nestedBracketReplacement", but I defer to you.

Were you thinking something like:

<innerBracket bracket="(">[</innerBracket>

I feel very neutral on the name of the XML tag.

See unicode-org#5240

jira-pull-request-webhook · 2026-01-17T09:09:22Z

Hooray! The files in the branch are the same across the force-push. 😃

~ Your Friendly Jira-GitHub PR Checker Bot

sffc · 2026-01-17T09:10:03Z

@conradarcturus did you wish to reply to my reply above, or shall I merge this?

github-actions bot assigned sffc Dec 17, 2025

sffc mentioned this pull request Dec 17, 2025

CLDR-17223 Use new menu attribute in territory display names #5225

Draft

1 task

sffc requested a review from macchiati December 17, 2025 02:23

sffc marked this pull request as ready for review December 17, 2025 02:23

macchiati reviewed Jan 14, 2026

View reviewed changes

macchiati approved these changes Jan 15, 2026

View reviewed changes

CLDR-17223 Add nestedBracketReplacement for use in display names

5bf384b

See unicode-org#5240

sffc force-pushed the nestedBracketReplacement branch from 09c4419 to 5bf384b Compare January 17, 2026 09:09

	`my-IN-t-en-MM-u-nu-latn`
Current Spec	Burmese (India, Transform: English [Myanmar [Burma]], Latin Digits)
Flatten, current order	Burmese (India, Transform: English, Myanmar [Burma], Latin Digits)
Flatten, new order	Burmese (India, Latin Digits, Transform: English, Myanmar [Burma])

CLDR-17223 Add nestedBracketReplacement for use in display names #5240

Are you sure you want to change the base?

CLDR-17223 Add nestedBracketReplacement for use in display names #5240

Uh oh!

Conversation

sffc commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sffc commented Dec 17, 2025

Uh oh!

conradarcturus commented Dec 19, 2025

Uh oh!

macchiati commented Jan 13, 2026

Uh oh!

macchiati left a comment

Choose a reason for hiding this comment

Uh oh!

macchiati Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

sffc Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

macchiati Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

sffc Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

macchiati Jan 15, 2026

Choose a reason for hiding this comment

Uh oh!

sffc commented Jan 15, 2026

Uh oh!

jira-pull-request-webhook bot commented Jan 17, 2026

Uh oh!

sffc commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sffc commented Dec 17, 2025 •

edited

Loading