-
Notifications
You must be signed in to change notification settings - Fork 904
Issues: Unstructured-IO/unstructured
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Table detection from PDF document not accurate
bug
Something isn't working
pdf
#3804
opened Dec 2, 2024 by
kmrspace
broken inference source code for 'hi_res', AttributeError: 'list' object has no attribute 'element_coords', the same code worked with previous versions of unstructured
bug
Something isn't working
pdf
#3718
opened Oct 13, 2024 by
Arslan-Mehmood1
bug/'text_as_html' result contains few incorrect/invalid characters
bug
Something isn't working
pdf
#3523
opened Aug 14, 2024 by
MuruganDurai
feat/pdf -> words being split across lines due to hyphenation
enhancement
New feature or request
pdf
#3486
opened Aug 5, 2024 by
ajpanyteam
feat/Understand and correctly parse ligatures
enhancement
New feature or request
pdf
#3471
opened Aug 2, 2024 by
jocubeit
bug/text-as-html-missing-content
bug
Something isn't working
pdf
#3358
opened Jul 8, 2024 by
mpolomdeepsense
bug/Two Column PDF partition result in incorrect text.
bug
Something isn't working
pdf
#3325
opened Jun 28, 2024 by
pfcharles
feat/bbox_scaling_parameter
enhancement
New feature or request
pdf
#3235
opened Jun 18, 2024 by
LesykDev
feat/Add page range to partition functions
enhancement
New feature or request
pdf
#3231
opened Jun 18, 2024 by
ChiNoel-osu
feat/table element coordinates
enhancement
New feature or request
pdf
#3175
opened Jun 10, 2024 by
naunidh-tetrix
Add manual coordinate constraints to New feature or request
pdf
partition_pdf()
.
enhancement
#3072
opened May 22, 2024 by
ChiNoel-osu
Problems when I parsing Chineses PDF documents
bug
Something isn't working
pdf
#2999
opened May 10, 2024 by
WangJiaxin-x
feat/add New feature or request
pdf
extract_image_block_output_dir
to partition_via_api
enhancement
#2833
opened Apr 2, 2024 by
awalker4
feat/ extract style or font for Text elements.
enhancement
New feature or request
pdf
#2695
opened Mar 26, 2024 by
LunaticMaestro
bug: correctly combine words spanning multiple lines
bug
Something isn't working
pdf
#2234
opened Dec 7, 2023 by
Coniferish
feat/Guard against excessive memory usage when partitioning PDFs
enhancement
New feature or request
pdf
#2129
opened Nov 20, 2023 by
flash1293
ProTip!
Adding no:label will show everything without a label.