several PDFs caused Qiqqa to run indefinitely after closing it

Continuation of #10, in a sense: different culprit, same pack of background tasks.

Now it turns out old `pdfdraw -tt` (see also #34: this bugger has to *go*) is locked up forever at max CPU for spurious / egregious PDFs. (🎅 isn't English language *fun* 🎅 ho ho ho! 🤡 )

That's the text extraction background process going b0rk b0rk b0rk on you. No way out but hard "kill process" for each of these.

## Targeted fix

Upgrading/migration to [latest MuPDF `mudraw` hOCR or JSON STEXT output](https://mupdf.com/docs/manual-mutool-draw.html) -- the old `pdfdraw` that comes with current Qiqqa installs is an *antique* patched MuPDF tool (#34 + #35) and lots have changed since then, including the relevant output format for extracted text. 

As I *intend* to support more document types (via the hOCR/HTML fundamental format), Qiqqa should grok the new `pdfdraw -o *.ocr.html` or similar output. 

Also keep in mind the migration from the *antique* (obsoleted) LuceneNET version to SOLR / ElasticSearch: that's #23 + #298 + [Technology areas and their *function* in Qiqqa](https://github.com/jimmejardine/qiqqa-open-source/blob/master/docs-src/Progress%20in%20Development/Qiqqa%20Functionalities%20%26%20Technology%20Areas.md) + [Towards migrating the PDF viewer / renderer / text extractor](https://github.com/jimmejardine/qiqqa-open-source/blob/master/docs-src/Progress%20in%20Development/Towards%20Migrating%20the%20PDF%20Viewer%20%2B%20Renderer%20(%2B%20Text%20Extractor).md)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

several PDFs caused Qiqqa to run indefinitely after closing it #305

Targeted fix

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

several PDFs caused Qiqqa to run indefinitely after closing it #305

Description

Targeted fix

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions