-
Notifications
You must be signed in to change notification settings - Fork 71
Description
Describe the bug
Thanks for this library. Very much helpful. I am seeing a weird issue. When parsing this PDF, each and every line is transformed as a header element instead of sentences/paragraphs. This issue is happening with original repo as well. Tested here: https://pdf2md.morethan.io/
To Reproduce
Steps to reproduce the behavior:
- call
@opendocsg/pdf2mdin cmd line with the above file as input - Check the output
Expected behavior
Some of the text in the pdf, for ex: Selecting the “right” amount of information to include in a summary is a difficult task. A good... should not be treated as header
Screenshots
If applicable, add screenshots to help explain your problem.
Desktop (please complete the following information):
- OS: macOs
- Browser NA
- Version NA
Smartphone (please complete the following information):
- Device: [e.g. iPhone6]
- OS: [e.g. iOS8.1]
- Browser [e.g. stock browser, safari]
- Version [e.g. 22]
Additional context
Since this issue exists in original repo, it will be great if you can point me how to resolve this issue. Appreciated!