Skip to content

Clairty around intended reading of "character" in section 4.2 and elsewhere (v1.6) #89

@aragilar

Description

@aragilar

The width attribute is meant to indicate the number of characters to be used for display of the quantity. This has special significance when used with character data, see below.

seems to imply "character" is the number of graphemes in a string (especially since this is in a section discussing display not storage), whereas later

When used with datatype="char" arrays the width attribute is interpreted as an upper bound for the array length in characters, i.e. Unicode code points.

implies "character" is code points.

In the case of multi-dimensional char arrays the value refers to the number of characters in each string element.

could mean in bytes, code points or graphemes (I think the implied reading is code points, but I'm not sure).

I think maybe the best thing to do is find every instance of character and replace it with the appropriate unicode term (see https://stackoverflow.com/questions/27331819/whats-the-difference-between-a-character-a-code-point-a-glyph-and-a-grapheme). I can try to find some time to make a PR with the suggested changes?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions