-
Notifications
You must be signed in to change notification settings - Fork 47
Open
Description
Hi there, does magic-doc also support outputting a structured json file (like in magic-pdf/minerU) which e.g shows the content this way:
[
{
"type": "text",
"text": "EXAMPLE TITLE ",
"text_level": 1,
"page_idx": 0
},
{
"type": "text",
"text": "SOME TEXT ABOUT THE DOC ",
"page_idx": 0
},
{
"type": "text",
"text": "SOME DETECTED SUBHEADER: ",
"text_level": 1,
"page_idx": 0
},
Would be great if one could get a output also this way, as this would allign well if one has a mix of pdfs and word documents and wants to get structured jsons for all instead of markdowns.
Metadata
Metadata
Assignees
Labels
No labels