Open
Description
Describe the bug
When specifying output_format
as csv
, the response from the api is different when split_pdf_page
is True
or False
. When False
, the elements contain an extra metadata field: text_as_html
. This also means the element id does not match.
To Reproduce
_test_unstructured_client/integration/test_decorators.py::test_integration_split_csv_response
illustrates this, but is passing because it asserts on a shortened string.
Expected behavior
The response to be identical whether or not split_pdf_page
is True
or False
.