Skip to content

unitable表格解析错误 #2270

Closed as not planned
Closed as not planned
@jylong91

Description

@jylong91

Description of the bug | 错误描述

unitable表格解析错误

How to reproduce the bug | 如何复现

mineru_v132_test | Traceback (most recent call last):
mineru_v132_test | File "/home/volumes/MinerU/src/pdf2md.py", line 100, in pdf2md
mineru_v132_test | mineru_interface(pdf_path, pdf_dir, table_parsing=table_parsing)
mineru_v132_test | File "/home/volumes/MinerU/src/mineru.py", line 21, in mineru_interface
mineru_v132_test | do_parse(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/tools/common.py", line 118, in do_parse
mineru_v132_test | infer_result = ds.apply(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/data/dataset.py", line 212, in apply
mineru_v132_test | return proc(self, *args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/doc_analyze_by_custom_model.py", line 197, in doc_analyze
mineru_v132_test | analyze_result = batch_model(images)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/batch_analyze.py", line 167, in call
mineru_v132_test | self.model.table_model.predict(new_image)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/sub_modules/table/rapidtable/rapid_table.py", line 63, in predict
mineru_v132_test | table_results = self.table_model(np.asarray(image), ocr_result)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/main.py", line 106, in call
mineru_v132_test | pred_structures, cell_bboxes, _ = self.table_structure(copy.deepcopy(img))
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
mineru_v132_test | return func(*args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/table_structure/table_structure_unitable.py", line 151, in call
mineru_v132_test | bboxes[:, 0::2] *= scale_w # 缩放 x 坐标
mineru_v132_test | IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed
mineru_v132_test | [2025-04-17 14:58:57,726] [ ERROR] pdf2md.py:120 - pdf file parser error: too many indices for array: array is 1-dimensional, but 2 were indexed
mineru_v132_test | Traceback (most recent call last):
mineru_v132_test | File "/home/volumes/MinerU/src/pdf2md.py", line 100, in pdf2md
mineru_v132_test | mineru_interface(pdf_path, pdf_dir, table_parsing=table_parsing)
mineru_v132_test | File "/home/volumes/MinerU/src/mineru.py", line 21, in mineru_interface
mineru_v132_test | do_parse(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/tools/common.py", line 118, in do_parse
mineru_v132_test | infer_result = ds.apply(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/data/dataset.py", line 212, in apply
mineru_v132_test | return proc(self, *args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/doc_analyze_by_custom_model.py", line 197, in doc_analyze
mineru_v132_test | analyze_result = batch_model(images)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/batch_analyze.py", line 167, in call
mineru_v132_test | self.model.table_model.predict(new_image)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/sub_modules/table/rapidtable/rapid_table.py", line 63, in predict
mineru_v132_test | table_results = self.table_model(np.asarray(image), ocr_result)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/main.py", line 106, in call
mineru_v132_test | pred_structures, cell_bboxes, _ = self.table_structure(copy.deepcopy(img))
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
mineru_v132_test | return func(*args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/table_structure/table_structure_unitable.py", line 151, in call
mineru_v132_test | bboxes[:, 0::2] *= scale_w # 缩放 x 坐标
mineru_v132_test | IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed

Operating system | 操作系统

Linux

Python version | Python 版本

3.10

Software version | 软件版本 (magic-pdf --version)

1.3.x

Device mode | 设备模式

cuda

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions