Skip to content

Bug: ASCII range mentioned as U+0001..U+007F, rather than U+0000..U+007F #2674

Open
@ezequiel-garzon

Description

@ezequiel-garzon

Describe the bug

Section 2.3. Identifiers and keywords of the python.org language reference indicates:

Within the ASCII range (U+0001..U+007F), the valid characters for identifiers include the uppercase and lowercase letters A through Z, the underscore _ and, except for the first character, the digits 0 through 9.

But the ASCII range is U+0000..U+007F, not U+0001..U+007F. The documentation points to PEP 3131, where the same range is used.

ASCII-range

To Reproduce

Visit the referenced webpage.

Expected behavior

Unicode's first block, with "Range: 0000..007F" (second line of U0000), is known as Basic Latin or C0 Controls and Basic Latin. The official charts index page refers to it as Basic Latin (ASCII), emphasizing the historical connection with the older standard.

Therefore, if it is deemed necessary to clarify the ASCII range at all, it should be U+0000..U+007F, not U+0001..U+007F.

URL to the issue

https://docs.python.org/3/reference/lexical_analysis.html#identifiers

Screenshots

No response

Browsers

Chrome

Operating System

macOS

Browser Version

No response

Relevant log output

No response

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugThis is a bug!

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions