0.16.24
0.16.24
Enhancements
-
Support dynamic partitioner file type registration. Use
create_file_type
to create new file type that can be handled
in unstructured andregister_partitioner
to enable registering your own partitioner for any file type. -
extract_image_block_types
now also works for CamelCase elemenet type names. PreviouslyNarrativeText
and similar CamelCase element types can't be extracted using the mentioned parameter inpartition
. Now figures for those elements can be extracted likeImage
andTable
elements -
use block matrix to reduce peak memory usage for pdf/image partition.
Features
- Add JSON elements to HTML converter - Converts JSON elements file into an HTML file.