Skip to content

Add UTF-8 encoded string to TextStringObject #3493

@j-t-1

Description

@j-t-1

Explanation

The text string type shall be used for character strings that shall be encoded in PDFDocEncoding, the UTF-16BE Unicode character encoding scheme, or (PDF 2.0) the UTF-8 Unicode character encoding scheme.

PDF 2.0 specification.

TextStringObject does not have UTF-8, introduced in PDF 2.0.

Adding this will update the classTextStringObject to the PDF 2.0 standard. Currently the class ignores UTF-8, and may be categorizing it as PDFDocEncoding.

Metadata

Metadata

Assignees

No one assigned

    Labels

    needs-example-codeThe issue needs a minimal and complete (e.g. all imports) example showing the problem

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions