Skip to content

error in loading other document #2

@simjak

Description

@simjak

Hey, thanks for awesome doc toolkit.

I tried to run pdf_path = "tests/test_files/direct_extract/single_column.pdf"

and got a following error:

2024-11-02 17:47:58,569 - rapid_layout - INFO: pp_layout_cdla contains ['text', 'title', 'figure', 'figure_caption', 'table', 'table_caption', 'header', 'footer', 'reference', 'equation']
  0%|                                                                                                                                                     | 0/1 [00:00<?, ?it/s]
Traceback (most recent call last):
  File "/Users/jakit/simonas/open-source/RapidDoc/demo.py", line 13, in <module>
    result = pdf_parser(pdf_path)
             ^^^^^^^^^^^^^^^^^^^^
  File "/Users/jakit/simonas/open-source/RapidDoc/rapid_doc/main.py", line 74, in __call__
    txt_boxes, txts = self.run_direct_extract(i, img_width)
                      ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/Users/jakit/simonas/open-source/RapidDoc/rapid_doc/main.py", line 105, in run_direct_extract
    txt_boxes, txts = self.pdf_extracter.extract_page_text(page_num, img_width)
    ^^^^^^^^^^^^^^^
ValueError: too many values to unpack (expected 2)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions