Commit 080ccfa
authored
fix: Set OMP_THREAD_LIMIT for better tesseract performance (#185)
I've spent some time playing with this var, and I came up with [this
gist](https://gist.github.com/awalker4/8581d76d373c1bc51e0f2676a6ad816c).
I ran this on a 4 core EC2 instance. Processing 3 pages without the
limit takes 153s. With the limit is 5s 😍 . When the number of pages is
higher than number of cores, it just hangs without this var.1 parent 9b6aa8e commit 080ccfa
File tree
3 files changed
+11
-1
lines changed- unstructured_inference
- models
3 files changed
+11
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
1 | 5 | | |
2 | 6 | | |
3 | 7 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
1 | 2 | | |
2 | 3 | | |
3 | 4 | | |
| |||
9 | 10 | | |
10 | 11 | | |
11 | 12 | | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
12 | 18 | | |
13 | 19 | | |
14 | 20 | | |
| |||
0 commit comments