File tree
8 files changed
+42
-22
lines changed- .github/workflows
- docs/source
- test_unstructured/nlp
- unstructured
- nlp
8 files changed
+42
-22
lines changedDiff for: .github/workflows/ci.yml
-1
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
103 | 103 |
| |
104 | 104 |
| |
105 | 105 |
| |
106 |
| - | |
107 | 106 |
| |
108 | 107 |
| |
109 | 108 |
| |
|
Diff for: CHANGELOG.md
+4
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
1 | 5 |
| |
2 | 6 |
| |
3 | 7 |
| |
|
Diff for: Makefile
-1
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
36 | 36 |
| |
37 | 37 |
| |
38 | 38 |
| |
39 |
| - | |
40 | 39 |
| |
41 | 40 |
| |
42 | 41 |
| |
|
Diff for: README.md
-4
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
62 | 62 |
| |
63 | 63 |
| |
64 | 64 |
| |
65 |
| - | |
66 |
| - | |
67 |
| - | |
68 |
| - | |
69 | 65 |
| |
70 | 66 |
| |
71 | 67 |
| |
|
Diff for: docs/source/installing.rst
-15
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
16 | 16 |
| |
17 | 17 |
| |
18 | 18 |
| |
19 |
| - | |
20 |
| - | |
21 |
| - | |
22 |
| - | |
23 | 19 |
| |
24 | 20 |
| |
25 | 21 |
| |
| |||
141 | 137 |
| |
142 | 138 |
| |
143 | 139 |
| |
144 |
| - | |
145 |
| - | |
146 |
| - | |
147 |
| - | |
148 |
| - | |
149 |
| - | |
150 |
| - | |
151 |
| - | |
152 |
| - | |
153 |
| - | |
154 |
| - | |
155 | 140 |
| |
156 | 141 |
| |
157 | 142 |
| |
|
Diff for: test_unstructured/nlp/test_tokenize.py
+19
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
| 2 | + | |
| 3 | + | |
| 4 | + | |
2 | 5 |
| |
3 | 6 |
| |
4 | 7 |
| |
5 | 8 |
| |
6 | 9 |
| |
7 | 10 |
| |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
8 | 27 |
| |
9 | 28 |
| |
10 | 29 |
| |
|
Diff for: unstructured/__version__.py
+1-1
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 |
| - | |
| 1 | + |
Diff for: unstructured/nlp/tokenize.py
+18
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
7 | 7 |
| |
8 | 8 |
| |
9 | 9 |
| |
| 10 | + | |
10 | 11 |
| |
11 | 12 |
| |
12 | 13 |
| |
| |||
16 | 17 |
| |
17 | 18 |
| |
18 | 19 |
| |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
19 | 37 |
| |
20 | 38 |
| |
21 | 39 |
| |
|
0 commit comments