@@ -6,10 +6,11 @@ There are already too many PDF libraries, unfortunately none of which
66does everything that everybody wants it to do, and we probably don't
77need another one. It is not recommended that you use this library for
88anything at all, but if you were going to use it for something, it
9- would be specifically one of these things and nothing else:
9+ might be one of these things, which you may currently be doing with
10+ pdfminer.six, for instance:
1011
11121 . Accessing the document catalog, page tree, structure tree, outline,
12- content streams, cross-reference table, XObjects, fonts,
13+ content streams, cross-reference table, XObjects, fonts, images,
1314 annotations, and other low-level PDF metadata.
14152 . Obtaining the absolute position and attributes of every character,
1516 line, path, and image in every page of a PDF.
@@ -25,7 +26,7 @@ API from pdfminer.six but without the
2526` NumerousLines.of(FrustratingBoilerplate()) ` that it takes just get
2627the g\* sh-d\* rned layout out of a PDF. (it does other things, too)
2728
28- The purpose of PLAYA is to provide an efficent, parallel and
29+ The purpose of PLAYA is to provide a robust, efficent, parallel and
2930parallelizable, pure-Python and Pythonic (for its author's definition
3031of the term), lazy interface to the internals of PDF files.
3132
@@ -47,7 +48,8 @@ metadata):
4748 playa --text fascinating-research-paper.pdf
4849 playa --text-objects colorful-presentation.pdf
4950
50- Or images, in JPEG and PNM format (may not work for all images):
51+ Or images, in JPEG and PNM (or sometimes TIFF) format (may not work
52+ for all images):
5153
5254 playa --images imagedir splashy-resume.pdf
5355
0 commit comments