Skip to content

Commit 544441b

Browse files
committed
docs: mini update
1 parent 8efa005 commit 544441b

File tree

1 file changed

+4
-5
lines changed

1 file changed

+4
-5
lines changed

README.md

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -39,8 +39,8 @@ This is generally faster than `pdfminer.six`. You can often make it
3939
even faster on large documents by running in parallel with the
4040
`max_workers` argument, which is the same as the one you will find in
4141
`concurrent.futures.ProcessPoolExecutor`. If you pass `None` it will
42-
use all your CPUs, but due to some unfortunate overhead (which will be
43-
fixed soon) this isn't so great, so 2-4 workers is best:
42+
use all your CPUs, but due to some unavoidable overhead, it usually
43+
doesn't help to use more than 2-4:
4444

4545
```
4646
for page in extract(path, laparams, max_workers=2):
@@ -116,9 +116,8 @@ from paves.bears import SCHEMA
116116
df = polars.DataFrame(extract(path), schema=SCHEMA)
117117
```
118118

119-
As above, you can use multiple CPUs with `max_workers`, though this
120-
will scale considerably better since the objects are (mostly) easily
121-
serializable.
119+
As above, you can use multiple CPUs with `max_workers`, and this will
120+
scale considerably better than `paves.miner`.
122121

123122
## License
124123

0 commit comments

Comments
 (0)