For tags, switch from Unicode “letter” to “alphabetic” for allowed characters? #64
jotaen
started this conversation in
File Format
Replies: 1 comment
-
|
tldr; I’m closing this for now, as I think the current definition is sufficient, especially considering that the “letter” definition is only relevant for tags, but not for any other text. The Unicode categories are defined as:
So, “alphabetic” is a superset of “letter”, that adds
For tags, I’m not sure that any of these additional categories would be useful. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
A “letter” (the allowed character class for tags) is currently defined as “a character from the Unicode Letter category (L)”.
This comment suggests that it would be better to use the “alphabetic” category instead, which appears to be a superset of “letter”.
A brief Google search didn’t yield too much substantial insight into the practical differences between the two categories, so I need to look into this more closely.
Beta Was this translation helpful? Give feedback.
All reactions