You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
### How this is different from other text extraction tools
6
14
7
15
- Parses file based on mime type, not file extension
8
16
- Does not spawn a child process to use a tool installed on the device
9
17
- Reads and returns text from file if it is a simple text file
10
18
19
+
## Libraries used
20
+
21
+
This module uses some amazing existing libraries that perform better than the ones that originally existed in this module, and are therefore used instead:
22
+
23
+
-[`pdf-parse`](https://www.npmjs.com/package/pdf-parse), for parsing PDF files
24
+
-[`xlsx`](https://www.npmjs.com/package/xlsx), for parsing MS Excel files
25
+
26
+
A big thank you to the contributors of these projects.
27
+
28
+
This module also uses:
29
+
30
+
-[`xml2js`](https://www.npmjs.com/package/xml2js) - to convert the MS Office XML files into JSON
31
+
-[`js-yaml`](https://www.npmjs.com/package/js-yaml) - to convert JSON into YAML
32
+
-[`file-type`](https://www.npmjs.com/package/file-type) - to detect the mime type of files
33
+
-[`decompress`](https://www.npmjs.com/package/decompress) - to unzip files
34
+
-[`read-chunk`](https://www.npmjs.com/package/read-chunk) - to read chunks of data from large files
35
+
11
36
## Installation
12
37
13
38
To use this in an npm project, simply type in:
@@ -16,11 +41,7 @@ To use this in an npm project, simply type in:
16
41
npm install office-text-extractor
17
42
```
18
43
19
-
**Notes:**
20
-
21
-
- No support for browser environments yet. If you want to add support, please feel free to [open a pull request](https://github.com/gamemaker1/office-text-extractor/pulls).
22
-
- To parse PDFs, this module uses the amazing `pdf-parse` npm package.
23
-
- To parse Excel files, this module uses the amazing `xlsx` npm package.
44
+
**There is no support for browser environments yet. If you want to add support, please feel free to [open a pull request](https://github.com/gamemaker1/office-text-extractor/pulls).**
0 commit comments