-
Notifications
You must be signed in to change notification settings - Fork 281
Open
Description
Hi team,
I'm working on improving our threat modeling application's diagram processing capabilities. Currently facing challenges with:
- OCR limitations in accurately extracting information from architecture diagrams
- Inconsistent results when processing PDF/JPG/PNG formats
- Loss of structural information when converting diagrams to text
I've heard DSL (Domain Specific Language) might be a better approach. Looking for recommendations on:
- Alternative approaches to diagram processing beyond OCR
- Best practices for maintaining diagram structure during processing
- Tools/libraries for parsing architecture diagrams programmatically
- Experience with DSL-based solutions (pros/cons)
Current stack:
- Python/AWS Bedrock/Claude
- Image processing: Pillow, pytesseract
- Input formats: PDF, JPG, PNG
Would appreciate any insights from those who've tackled similar challenges.
Metadata
Metadata
Assignees
Labels
No labels