Open
Description
Description As a developer, I want to prototype a solution using pdf2htmlEX to convert PDFs to HTML and process the output with Nokogiri, so that I can determine if this approach produces structured and accessible HTML consistently.
Details on decision narratives here (including mock letter, and examination of 961 letter content).
Hypothetical implementation here.
Acceptance Criteria
- The service accepts a PDF file as input.
- pdf2htmlEX successfully converts the PDF into an HTML format while maintaining layout structure.
- Nokogiri processes the HTML to:
Convert headings (h1, h2) based on font size.
Convert lists (ul, ol, li).
Structure tables with , , .
Ensure images have alt text.
Remove absolute positioning styles for accessibility. - The service returns a well-structured HTML output.
- The output is evaluated for consistency across multiple decision narrative PDFs.
- This protype is compared to the others and a recommendation is made
Metadata
Metadata
Assignees
Labels
No labels
Activity