Skip to content

Commit b4ba46c

Browse files
authored
Subscript wyzɣ (#955)
* UnicodeData.txt lines from L2/24-219 * lb=AL * Latin * Regenerate UCD * Failing test * Other_Lowercase * Regenerate UCD * Make the test pass * Ignore IDNA2008_Category
1 parent b11408d commit b4ba46c

20 files changed

+153
-99
lines changed

unicodetools/data/ucd/dev/DerivedAge.txt

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedAge-18.0.0.txt
2-
# Date: 2025-11-27, 16:49:00 GMT
2+
# Date: 2025-11-27, 17:33:04 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -2128,6 +2128,7 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG
21282128
0984 ; 18.0 # BENGALI SIGN COMBINING ANUSVARA ABOVE
21292129
1ADE..1ADF ; 18.0 # [2] COMBINING GRAVE-DOT..COMBINING DOT-ACUTE
21302130
1AEC..1AF0 ; 18.0 # [5] COMBINING CARON-ACUTE..COMBINING DOUBLE COMMA ABOVE
2131+
209D..209F ; 18.0 # [3] LATIN SUBSCRIPT SMALL LETTER W..LATIN SUBSCRIPT SMALL LETTER Z
21312132
20C2..20C3 ; 18.0 # [2] RUFIYAA SIGN..UAE DIRHAM SIGN
21322133
107BB..107BF ; 18.0 # [5] MODIFIER LETTER SMALL TURNED T..MODIFIER LETTER SMALL ESH WITH DOUBLE BAR
21332134
10ED9..10EEE ; 18.0 # [22] ARABIC CROWN LETTER BEH..ARABIC CROWN LETTER YEH
@@ -2142,12 +2143,12 @@ FDC8..FDCE ; 17.0 # [7] ARABIC LIGATURE RAHIMAHU ALLAAH TAAALAA..ARABIC LIG
21422143
18D1F..18D20 ; 18.0 # [2] TANGUT IDEOGRAPH-18D1F..TANGUT IDEOGRAPH-18D20
21432144
1DF1F..1DF24 ; 18.0 # [6] LATIN SMALL LETTER D-ETH DIGRAPH..LATIN SMALL LETTER T-THETA DIGRAPH
21442145
1DF2B..1DF56 ; 18.0 # [44] LATIN SMALL LETTER DEZH DIGRAPH WITH CURL..LATIN LETTER GLOTTAL STOP WITH DOUBLE STROKE
2145-
1DFD1..1DFFF ; 18.0 # [47] MODIFIER LETTER SMALL CAPITAL P..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
2146+
1DFD0..1DFFF ; 18.0 # [48] LATIN SUBSCRIPT SMALL LETTER GAMMA..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
21462147
1F7DB ; 18.0 # BULLET IN DOUBLE CIRCLE
21472148
1F7F1..1F7FF ; 18.0 # [15] CIRCLE WITH DOUBLE VERTICAL AND HORIZONTAL LINE..RHOMBUS
21482149
2B81E ; 18.0 # CJK UNIFIED IDEOGRAPH-2B81E
21492150
3D000..3FC3F ; 18.0 # [11328] SEAL CHARACTER-3D000..SEAL CHARACTER-3FC3F
21502151

2151-
# Total code points: 11855
2152+
# Total code points: 11859
21522153

21532154
# EOF

unicodetools/data/ucd/dev/DerivedCoreProperties.txt

Lines changed: 28 additions & 28 deletions
Large diffs are not rendered by default.

unicodetools/data/ucd/dev/DerivedNormalizationProps.txt

Lines changed: 20 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DerivedNormalizationProps-18.0.0.txt
2-
# Date: 2025-11-27, 16:49:28 GMT
2+
# Date: 2025-11-27, 17:33:31 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1385,7 +1385,7 @@ FB46..FB4E ; NFC_QC; N # Lo [9] HEBREW LETTER TSADI WITH DAGESH..HEBREW LET
13851385
208A..208C ; NFKD_QC; N # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
13861386
208D ; NFKD_QC; N # Ps SUBSCRIPT LEFT PARENTHESIS
13871387
208E ; NFKD_QC; N # Pe SUBSCRIPT RIGHT PARENTHESIS
1388-
2090..209C ; NFKD_QC; N # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
1388+
2090..209F ; NFKD_QC; N # Lm [16] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER Z
13891389
20A8 ; NFKD_QC; N # Sc RUPEE SIGN
13901390
2100..2101 ; NFKD_QC; N # So [2] ACCOUNT OF..ADDRESSED TO THE SUBJECT
13911391
2102 ; NFKD_QC; N # L& DOUBLE-STRUCK CAPITAL C
@@ -1710,7 +1710,7 @@ FFED..FFEE ; NFKD_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
17101710
1D7C3 ; NFKD_QC; N # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC PARTIAL DIFFERENTIAL
17111711
1D7C4..1D7CB ; NFKD_QC; N # L& [8] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD SMALL DIGAMMA
17121712
1D7CE..1D7FF ; NFKD_QC; N # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
1713-
1DFD1..1DFFF ; NFKD_QC; N # Lm [47] MODIFIER LETTER SMALL CAPITAL P..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
1713+
1DFD0..1DFFF ; NFKD_QC; N # Lm [48] LATIN SUBSCRIPT SMALL LETTER GAMMA..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
17141714
1E030..1E06D ; NFKD_QC; N # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
17151715
1EE00..1EE03 ; NFKD_QC; N # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
17161716
1EE05..1EE1F ; NFKD_QC; N # Lo [27] ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL DOTLESS QAF
@@ -1757,7 +1757,7 @@ FFED..FFEE ; NFKD_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
17571757
1FBF0..1FBF9 ; NFKD_QC; N # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
17581758
2F800..2FA1D ; NFKD_QC; N # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
17591759

1760-
# Total code points: 17141
1760+
# Total code points: 17145
17611761

17621762
# ================================================
17631763

@@ -1888,7 +1888,7 @@ FFED..FFEE ; NFKD_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
18881888
208A..208C ; NFKC_QC; N # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
18891889
208D ; NFKC_QC; N # Ps SUBSCRIPT LEFT PARENTHESIS
18901890
208E ; NFKC_QC; N # Pe SUBSCRIPT RIGHT PARENTHESIS
1891-
2090..209C ; NFKC_QC; N # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
1891+
2090..209F ; NFKC_QC; N # Lm [16] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER Z
18921892
20A8 ; NFKC_QC; N # Sc RUPEE SIGN
18931893
2100..2101 ; NFKC_QC; N # So [2] ACCOUNT OF..ADDRESSED TO THE SUBJECT
18941894
2102 ; NFKC_QC; N # L& DOUBLE-STRUCK CAPITAL C
@@ -2124,7 +2124,7 @@ FFED..FFEE ; NFKC_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
21242124
1D7C3 ; NFKC_QC; N # Sm MATHEMATICAL SANS-SERIF BOLD ITALIC PARTIAL DIFFERENTIAL
21252125
1D7C4..1D7CB ; NFKC_QC; N # L& [8] MATHEMATICAL SANS-SERIF BOLD ITALIC EPSILON SYMBOL..MATHEMATICAL BOLD SMALL DIGAMMA
21262126
1D7CE..1D7FF ; NFKC_QC; N # Nd [50] MATHEMATICAL BOLD DIGIT ZERO..MATHEMATICAL MONOSPACE DIGIT NINE
2127-
1DFD1..1DFFF ; NFKC_QC; N # Lm [47] MODIFIER LETTER SMALL CAPITAL P..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
2127+
1DFD0..1DFFF ; NFKC_QC; N # Lm [48] LATIN SUBSCRIPT SMALL LETTER GAMMA..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
21282128
1E030..1E06D ; NFKC_QC; N # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
21292129
1EE00..1EE03 ; NFKC_QC; N # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
21302130
1EE05..1EE1F ; NFKC_QC; N # Lo [27] ARABIC MATHEMATICAL WAW..ARABIC MATHEMATICAL DOTLESS QAF
@@ -2171,7 +2171,7 @@ FFED..FFEE ; NFKC_QC; N # So [2] HALFWIDTH BLACK SQUARE..HALFWIDTH WHITE CI
21712171
1FBF0..1FBF9 ; NFKC_QC; N # Nd [10] SEGMENTED DIGIT ZERO..SEGMENTED DIGIT NINE
21722172
2F800..2FA1D ; NFKC_QC; N # Lo [542] CJK COMPATIBILITY IDEOGRAPH-2F800..CJK COMPATIBILITY IDEOGRAPH-2FA1D
21732173

2174-
# Total code points: 5020
2174+
# Total code points: 5024
21752175

21762176
# ================================================
21772177

@@ -4132,6 +4132,9 @@ FFE3 ; Expands_On_NFKC # Sk FULLWIDTH MACRON
41324132
209A ; NFKC_CF; 0070 # Lm LATIN SUBSCRIPT SMALL LETTER P
41334133
209B ; NFKC_CF; 0073 # Lm LATIN SUBSCRIPT SMALL LETTER S
41344134
209C ; NFKC_CF; 0074 # Lm LATIN SUBSCRIPT SMALL LETTER T
4135+
209D ; NFKC_CF; 0077 # Lm LATIN SUBSCRIPT SMALL LETTER W
4136+
209E ; NFKC_CF; 0079 # Lm LATIN SUBSCRIPT SMALL LETTER Y
4137+
209F ; NFKC_CF; 007A # Lm LATIN SUBSCRIPT SMALL LETTER Z
41354138
20A8 ; NFKC_CF; 0072 0073 # Sc RUPEE SIGN
41364139
2100 ; NFKC_CF; 0061 002F 0063 # So ACCOUNT OF
41374140
2101 ; NFKC_CF; 0061 002F 0073 # So ADDRESSED TO THE SUBJECT
@@ -8274,6 +8277,7 @@ FFF0..FFF8 ; NFKC_CF; # Cn [9] <reserved-FFF0>..<reserved-FF
82748277
1DF4A ; NFKC_CF; 1DF4B # L& LATIN CAPITAL LETTER BARRED M
82758278
1DF4D ; NFKC_CF; 1DF4E # L& LATIN CAPITAL LETTER BARRED N
82768279
1DF51 ; NFKC_CF; 1DF52 # L& LATIN CAPITAL LETTER BARRED V
8280+
1DFD0 ; NFKC_CF; 0263 # Lm LATIN SUBSCRIPT SMALL LETTER GAMMA
82778281
1DFD1 ; NFKC_CF; 1D18 # Lm MODIFIER LETTER SMALL CAPITAL P
82788282
1DFD2 ; NFKC_CF; 0180 # Lm MODIFIER LETTER SMALL B WITH STROKE
82798283
1DFD3 ; NFKC_CF; 0111 # Lm MODIFIER LETTER SMALL D WITH STROKE
@@ -9244,7 +9248,7 @@ E0080..E00FF ; NFKC_CF; # Cn [128] <reserved-E0080>..<reserved-E
92449248
E0100..E01EF ; NFKC_CF; # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
92459249
E01F0..E0FFF ; NFKC_CF; # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
92469250

9247-
# Total code points: 10643
9251+
# Total code points: 10647
92489252

92499253
# ================================================
92509254

@@ -10355,6 +10359,9 @@ E01F0..E0FFF ; NFKC_CF; # Cn [3600] <reserved-E01F0>..<reserved-
1035510359
209A ; NFKC_SCF; 0070 # Lm LATIN SUBSCRIPT SMALL LETTER P
1035610360
209B ; NFKC_SCF; 0073 # Lm LATIN SUBSCRIPT SMALL LETTER S
1035710361
209C ; NFKC_SCF; 0074 # Lm LATIN SUBSCRIPT SMALL LETTER T
10362+
209D ; NFKC_SCF; 0077 # Lm LATIN SUBSCRIPT SMALL LETTER W
10363+
209E ; NFKC_SCF; 0079 # Lm LATIN SUBSCRIPT SMALL LETTER Y
10364+
209F ; NFKC_SCF; 007A # Lm LATIN SUBSCRIPT SMALL LETTER Z
1035810365
20A8 ; NFKC_SCF; 0072 0073 # Sc RUPEE SIGN
1035910366
2100 ; NFKC_SCF; 0061 002F 0063 # So ACCOUNT OF
1036010367
2101 ; NFKC_SCF; 0061 002F 0073 # So ADDRESSED TO THE SUBJECT
@@ -14497,6 +14504,7 @@ FFF0..FFF8 ; NFKC_SCF; # Cn [9] <reserved-FFF0>..<reserved-F
1449714504
1DF4A ; NFKC_SCF; 1DF4B # L& LATIN CAPITAL LETTER BARRED M
1449814505
1DF4D ; NFKC_SCF; 1DF4E # L& LATIN CAPITAL LETTER BARRED N
1449914506
1DF51 ; NFKC_SCF; 1DF52 # L& LATIN CAPITAL LETTER BARRED V
14507+
1DFD0 ; NFKC_SCF; 0263 # Lm LATIN SUBSCRIPT SMALL LETTER GAMMA
1450014508
1DFD1 ; NFKC_SCF; 1D18 # Lm MODIFIER LETTER SMALL CAPITAL P
1450114509
1DFD2 ; NFKC_SCF; 0180 # Lm MODIFIER LETTER SMALL B WITH STROKE
1450214510
1DFD3 ; NFKC_SCF; 0111 # Lm MODIFIER LETTER SMALL D WITH STROKE
@@ -15467,7 +15475,7 @@ E0080..E00FF ; NFKC_SCF; # Cn [128] <reserved-E0080>..<reserved-
1546715475
E0100..E01EF ; NFKC_SCF; # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1546815476
E01F0..E0FFF ; NFKC_SCF; # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
1546915477

15470-
# Total code points: 10605
15478+
# Total code points: 10609
1547115479

1547215480
# ================================================
1547315481

@@ -16005,7 +16013,7 @@ E01F0..E0FFF ; NFKC_SCF; # Cn [3600] <reserved-E01F0>..<reserved
1600516013
208A..208C ; Changes_When_NFKC_Casefolded # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
1600616014
208D ; Changes_When_NFKC_Casefolded # Ps SUBSCRIPT LEFT PARENTHESIS
1600716015
208E ; Changes_When_NFKC_Casefolded # Pe SUBSCRIPT RIGHT PARENTHESIS
16008-
2090..209C ; Changes_When_NFKC_Casefolded # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
16016+
2090..209F ; Changes_When_NFKC_Casefolded # Lm [16] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER Z
1600916017
20A8 ; Changes_When_NFKC_Casefolded # Sc RUPEE SIGN
1601016018
2100..2101 ; Changes_When_NFKC_Casefolded # So [2] ACCOUNT OF..ADDRESSED TO THE SUBJECT
1601116019
2102 ; Changes_When_NFKC_Casefolded # L& DOUBLE-STRUCK CAPITAL C
@@ -16442,7 +16450,7 @@ FFF0..FFF8 ; Changes_When_NFKC_Casefolded # Cn [9] <reserved-FFF0>..<reserv
1644216450
1DF4A ; Changes_When_NFKC_Casefolded # L& LATIN CAPITAL LETTER BARRED M
1644316451
1DF4D ; Changes_When_NFKC_Casefolded # L& LATIN CAPITAL LETTER BARRED N
1644416452
1DF51 ; Changes_When_NFKC_Casefolded # L& LATIN CAPITAL LETTER BARRED V
16445-
1DFD1..1DFFF ; Changes_When_NFKC_Casefolded # Lm [47] MODIFIER LETTER SMALL CAPITAL P..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
16453+
1DFD0..1DFFF ; Changes_When_NFKC_Casefolded # Lm [48] LATIN SUBSCRIPT SMALL LETTER GAMMA..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
1644616454
1E030..1E06D ; Changes_When_NFKC_Casefolded # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
1644716455
1E900..1E921 ; Changes_When_NFKC_Casefolded # L& [34] ADLAM CAPITAL LETTER ALIF..ADLAM CAPITAL LETTER SHA
1644816456
1EE00..1EE03 ; Changes_When_NFKC_Casefolded # Lo [4] ARABIC MATHEMATICAL ALEF..ARABIC MATHEMATICAL DAL
@@ -16497,6 +16505,6 @@ E0080..E00FF ; Changes_When_NFKC_Casefolded # Cn [128] <reserved-E0080>..<reser
1649716505
E0100..E01EF ; Changes_When_NFKC_Casefolded # Mn [240] VARIATION SELECTOR-17..VARIATION SELECTOR-256
1649816506
E01F0..E0FFF ; Changes_When_NFKC_Casefolded # Cn [3600] <reserved-E01F0>..<reserved-E0FFF>
1649916507

16500-
# Total code points: 10643
16508+
# Total code points: 10647
1650116509

1650216510
# EOF

unicodetools/data/ucd/dev/EastAsianWidth.txt

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# EastAsianWidth-18.0.0.txt
2-
# Date: 2025-11-27, 16:49:32 GMT
2+
# Date: 2025-11-27, 17:33:35 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -973,7 +973,7 @@
973973
208A..208C ; N # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
974974
208D ; N # Ps SUBSCRIPT LEFT PARENTHESIS
975975
208E ; N # Pe SUBSCRIPT RIGHT PARENTHESIS
976-
2090..209C ; N # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
976+
2090..209F ; N # Lm [16] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER Z
977977
20A0..20A8 ; N # Sc [9] EURO-CURRENCY SIGN..RUPEE SIGN
978978
20A9 ; H # Sc WON SIGN
979979
20AA..20AB ; N # Sc [2] NEW SHEQEL SIGN..DONG SIGN
@@ -2501,7 +2501,7 @@ FFFD ; A # So REPLACEMENT CHARACTER
25012501
1DF00..1DF09 ; N # Ll [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
25022502
1DF0A ; N # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
25032503
1DF0B..1DF56 ; N # L& [76] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN LETTER GLOTTAL STOP WITH DOUBLE STROKE
2504-
1DFD1..1DFFF ; N # Lm [47] MODIFIER LETTER SMALL CAPITAL P..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
2504+
1DFD0..1DFFF ; N # Lm [48] LATIN SUBSCRIPT SMALL LETTER GAMMA..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
25052505
1E000..1E006 ; N # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
25062506
1E008..1E018 ; N # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
25072507
1E01B..1E021 ; N # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI

unicodetools/data/ucd/dev/LineBreak.txt

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# LineBreak-18.0.0.txt
2-
# Date: 2025-11-27, 16:49:33 GMT
2+
# Date: 2025-11-27, 17:33:36 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -961,7 +961,7 @@
961961
208A..208C ; AL # Sm [3] SUBSCRIPT PLUS SIGN..SUBSCRIPT EQUALS SIGN
962962
208D ; OP # Ps SUBSCRIPT LEFT PARENTHESIS
963963
208E ; CL # Pe SUBSCRIPT RIGHT PARENTHESIS
964-
2090..209C ; AL # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
964+
2090..209F ; AL # Lm [16] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER Z
965965
20A0..20A6 ; PR # Sc [7] EURO-CURRENCY SIGN..NAIRA SIGN
966966
20A7 ; PO # Sc PESETA SIGN
967967
20A8..20B5 ; PR # Sc [14] RUPEE SIGN..CEDI SIGN
@@ -3413,7 +3413,7 @@ FFFD ; AI # So REPLACEMENT CHARACTER
34133413
1DF00..1DF09 ; AL # Ll [10] LATIN SMALL LETTER FENG DIGRAPH WITH TRILL..LATIN SMALL LETTER T WITH HOOK AND RETROFLEX HOOK
34143414
1DF0A ; AL # Lo LATIN LETTER RETROFLEX CLICK WITH RETROFLEX HOOK
34153415
1DF0B..1DF56 ; AL # L& [76] LATIN SMALL LETTER ESH WITH DOUBLE BAR..LATIN LETTER GLOTTAL STOP WITH DOUBLE STROKE
3416-
1DFD1..1DFFF ; AL # Lm [47] MODIFIER LETTER SMALL CAPITAL P..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
3416+
1DFD0..1DFFF ; AL # Lm [48] LATIN SUBSCRIPT SMALL LETTER GAMMA..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
34173417
1E000..1E006 ; CM # Mn [7] COMBINING GLAGOLITIC LETTER AZU..COMBINING GLAGOLITIC LETTER ZHIVETE
34183418
1E008..1E018 ; CM # Mn [17] COMBINING GLAGOLITIC LETTER ZEMLJA..COMBINING GLAGOLITIC LETTER HERU
34193419
1E01B..1E021 ; CM # Mn [7] COMBINING GLAGOLITIC LETTER SHTA..COMBINING GLAGOLITIC LETTER YATI

unicodetools/data/ucd/dev/NormalizationTest.txt

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# NormalizationTest-18.0.0.txt
2-
# Date: 2025-11-27, 16:49:40 GMT
2+
# Date: 2025-11-27, 17:33:43 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1233,6 +1233,9 @@ FEFA 0334;FEFA 0334;FEFA 0334;0644 0625 0334;0644 0627 0334 0655; # (ﻺ◌̴;
12331233
209A;209A;209A;0070;0070; # (ₚ; ₚ; ₚ; p; p; ) LATIN SUBSCRIPT SMALL LETTER P
12341234
209B;209B;209B;0073;0073; # (ₛ; ₛ; ₛ; s; s; ) LATIN SUBSCRIPT SMALL LETTER S
12351235
209C;209C;209C;0074;0074; # (ₜ; ₜ; ₜ; t; t; ) LATIN SUBSCRIPT SMALL LETTER T
1236+
209D;209D;209D;0077;0077; # (₝; ₝; ₝; w; w; ) LATIN SUBSCRIPT SMALL LETTER W
1237+
209E;209E;209E;0079;0079; # (₞; ₞; ₞; y; y; ) LATIN SUBSCRIPT SMALL LETTER Y
1238+
209F;209F;209F;007A;007A; # (₟; ₟; ₟; z; z; ) LATIN SUBSCRIPT SMALL LETTER Z
12361239
20A8;20A8;20A8;0052 0073;0052 0073; # (₨; ₨; ₨; Rs; Rs; ) RUPEE SIGN
12371240
2100;2100;2100;0061 002F 0063;0061 002F 0063; # (℀; ℀; ℀; a/c; a/c; ) ACCOUNT OF
12381241
2101;2101;2101;0061 002F 0073;0061 002F 0073; # (℁; ℁; ℁; a/s; a/s; ) ADDRESSED TO THE SUBJECT
@@ -16293,6 +16296,7 @@ FFEE;FFEE;FFEE;25CB;25CB; # (○; ○; ○; ○; ○; ) HALFWIDTH WHITE CIRCLE
1629316296
1D7FD;1D7FD;1D7FD;0037;0037; # (𝟽; 𝟽; 𝟽; 7; 7; ) MATHEMATICAL MONOSPACE DIGIT SEVEN
1629416297
1D7FE;1D7FE;1D7FE;0038;0038; # (𝟾; 𝟾; 𝟾; 8; 8; ) MATHEMATICAL MONOSPACE DIGIT EIGHT
1629516298
1D7FF;1D7FF;1D7FF;0039;0039; # (𝟿; 𝟿; 𝟿; 9; 9; ) MATHEMATICAL MONOSPACE DIGIT NINE
16299+
1DFD0;1DFD0;1DFD0;0263;0263; # (𝿐; 𝿐; 𝿐; ɣ; ɣ; ) LATIN SUBSCRIPT SMALL LETTER GAMMA
1629616300
1DFD1;1DFD1;1DFD1;1D18;1D18; # (𝿑; 𝿑; 𝿑; ᴘ; ᴘ; ) MODIFIER LETTER SMALL CAPITAL P
1629716301
1DFD2;1DFD2;1DFD2;0180;0180; # (𝿒; 𝿒; 𝿒; ƀ; ƀ; ) MODIFIER LETTER SMALL B WITH STROKE
1629816302
1DFD3;1DFD3;1DFD3;0111;0111; # (𝿓; 𝿓; 𝿓; đ; đ; ) MODIFIER LETTER SMALL D WITH STROKE

unicodetools/data/ucd/dev/PropList.txt

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# PropList-18.0.0.txt
2-
# Date: 2025-11-27, 16:49:45 GMT
2+
# Date: 2025-11-27, 17:33:47 GMT
33
# © 2025 Unicode®, Inc.
44
# Unicode and the Unicode Logo are registered trademarks of Unicode, Inc. in the U.S. and other countries.
55
# For terms of use and license, see https://www.unicode.org/terms_of_use.html
@@ -1240,7 +1240,7 @@ FF70 ; Extender # Lm HALFWIDTH KATAKANA-HIRAGANA PROLONGED SOUND
12401240
1D9B..1DBF ; Other_Lowercase # Lm [37] MODIFIER LETTER SMALL TURNED ALPHA..MODIFIER LETTER SMALL THETA
12411241
2071 ; Other_Lowercase # Lm SUPERSCRIPT LATIN SMALL LETTER I
12421242
207F ; Other_Lowercase # Lm SUPERSCRIPT LATIN SMALL LETTER N
1243-
2090..209C ; Other_Lowercase # Lm [13] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER T
1243+
2090..209F ; Other_Lowercase # Lm [16] LATIN SUBSCRIPT SMALL LETTER A..LATIN SUBSCRIPT SMALL LETTER Z
12441244
2170..217F ; Other_Lowercase # Nl [16] SMALL ROMAN NUMERAL ONE..SMALL ROMAN NUMERAL ONE THOUSAND
12451245
24D0..24E9 ; Other_Lowercase # So [26] CIRCLED LATIN SMALL LETTER A..CIRCLED LATIN SMALL LETTER Z
12461246
2C7C..2C7D ; Other_Lowercase # Lm [2] LATIN SUBSCRIPT SMALL LETTER J..MODIFIER LETTER CAPITAL V
@@ -1254,10 +1254,10 @@ AB69 ; Other_Lowercase # Lm MODIFIER LETTER SMALL TURNED W
12541254
10783..10785 ; Other_Lowercase # Lm [3] MODIFIER LETTER SMALL AE..MODIFIER LETTER SMALL B WITH HOOK
12551255
10787..107B0 ; Other_Lowercase # Lm [42] MODIFIER LETTER SMALL DZ DIGRAPH..MODIFIER LETTER SMALL V WITH RIGHT HOOK
12561256
107B2..107BF ; Other_Lowercase # Lm [14] MODIFIER LETTER SMALL CAPITAL Y..MODIFIER LETTER SMALL ESH WITH DOUBLE BAR
1257-
1DFD1..1DFFF ; Other_Lowercase # Lm [47] MODIFIER LETTER SMALL CAPITAL P..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
1257+
1DFD0..1DFFF ; Other_Lowercase # Lm [48] LATIN SUBSCRIPT SMALL LETTER GAMMA..MODIFIER LETTER SMALL T WITH HOOK AND RETROFLEX HOOK
12581258
1E030..1E06D ; Other_Lowercase # Lm [62] MODIFIER LETTER CYRILLIC SMALL A..MODIFIER LETTER CYRILLIC SMALL STRAIGHT U WITH STROKE
12591259

1260-
# Total code points: 367
1260+
# Total code points: 371
12611261

12621262
# ================================================
12631263

0 commit comments

Comments
 (0)