File tree Expand file tree Collapse file tree 1 file changed +4
-5
lines changed Expand file tree Collapse file tree 1 file changed +4
-5
lines changed Original file line number Diff line number Diff line change @@ -39,8 +39,8 @@ This is generally faster than `pdfminer.six`. You can often make it
3939even faster on large documents by running in parallel with the
4040` max_workers ` argument, which is the same as the one you will find in
4141` concurrent.futures.ProcessPoolExecutor ` . If you pass ` None ` it will
42- use all your CPUs, but due to some unfortunate overhead (which will be
43- fixed soon) this isn 't so great, so 2-4 workers is best :
42+ use all your CPUs, but due to some unavoidable overhead, it usually
43+ doesn 't help to use more than 2-4:
4444
4545```
4646for page in extract(path, laparams, max_workers=2):
@@ -116,9 +116,8 @@ from paves.bears import SCHEMA
116116df = polars.DataFrame(extract(path), schema = SCHEMA )
117117```
118118
119- As above, you can use multiple CPUs with ` max_workers ` , though this
120- will scale considerably better since the objects are (mostly) easily
121- serializable.
119+ As above, you can use multiple CPUs with ` max_workers ` , and this will
120+ scale considerably better than ` paves.miner ` .
122121
123122## License
124123
You can’t perform that action at this time.
0 commit comments