Skip to content

feat/Allow max-pages/max-total-characters that should be parsed #3137

Open
@abdofallah

Description

@abdofallah

My service allows only 20k characters which is around 6 pages of an pdf file, but if someone uploads a 200+ pages pdf, it takes 6minutes to process after which i check how much characters are there in the file.

is there a feature in unstructured that stops the processing automatically if x amount of total characters has been reached? (plus include in the response that the whole file was not processed and cut off).

Metadata

Metadata

Assignees

No one assigned

    Labels

    chunkingRelated to element chunking.enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions