Skip to content

MonlamUniOuChan1 maps to Tai Tham script instead of Tibetan F/IE4CZ369298 #4

@10zYmon

Description

@10zYmon

Description

Folder name: F/IE4CZ369298
File name: TI906-01-001.pdf
Fonts: MonlamUniOuChan1

Running pdf-cmap-fix on a PDF, tool detects the font and reports 2588 GID upgrades, but the text output remains identical to the raw extraction(no changes).

Before extraction: refer the screenshots
Image

After extraction :
༸ᩃ᩶་ᩲ᪱᫥་ᩖ᩶་᨜ᨑ་ᩂ᫞་᪍ᨕ་᩶᪍᫞་ᨵ᫥ᩏ་᪠་᫕᩹᫥་᪍᫕ᨑ་
᪠ᨪᨓ᩶་᫙ᨑ᪱་ᨹᨑ་᪇᪱་᪠᫡᩶་᩶ᩏᨕ᫥་ᨼ᪠་᨜ᨑ་
ᩂ་᫄᪱ᨕ᫕ᨑ་ᩥᨑᩏ་᪍ᨕ་ᫍᨓ᫥་བྱ་བའི་
དཀར་ཆག

Subtasks

  • built custom lookup dictonary
  • run test conversion
  • document the output result

reviewer

Metadata

Metadata

Labels

No labels
No labels

Type

No fields configured for Task.

Projects

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions