-
Notifications
You must be signed in to change notification settings - Fork 46
Open
Labels
Description
We could provide a new extract-tables subcommand, thats uses the camelot to extract tables from PDF files.
The PR implementing this should include:
- unit tests
- documentation: docstrings & a new page in
docs/user/ - the command output should display the
.parsing_reportfromcamelot - it should be possible to target specific PDF pages
- various export options should be possible, using the corresponding
camelotmethods: to_csv() , to_json(), to_excel(), to_html(), to_markdown() & to_sqlite(). - other options could be implemented immediately or in further PRs:
--passwordfor decryption,--flavor,--parallel,--split-text