Description
Description of the bug | 错误描述
unitable表格解析错误
How to reproduce the bug | 如何复现
mineru_v132_test | Traceback (most recent call last):
mineru_v132_test | File "/home/volumes/MinerU/src/pdf2md.py", line 100, in pdf2md
mineru_v132_test | mineru_interface(pdf_path, pdf_dir, table_parsing=table_parsing)
mineru_v132_test | File "/home/volumes/MinerU/src/mineru.py", line 21, in mineru_interface
mineru_v132_test | do_parse(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/tools/common.py", line 118, in do_parse
mineru_v132_test | infer_result = ds.apply(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/data/dataset.py", line 212, in apply
mineru_v132_test | return proc(self, *args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/doc_analyze_by_custom_model.py", line 197, in doc_analyze
mineru_v132_test | analyze_result = batch_model(images)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/batch_analyze.py", line 167, in call
mineru_v132_test | self.model.table_model.predict(new_image)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/sub_modules/table/rapidtable/rapid_table.py", line 63, in predict
mineru_v132_test | table_results = self.table_model(np.asarray(image), ocr_result)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/main.py", line 106, in call
mineru_v132_test | pred_structures, cell_bboxes, _ = self.table_structure(copy.deepcopy(img))
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
mineru_v132_test | return func(*args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/table_structure/table_structure_unitable.py", line 151, in call
mineru_v132_test | bboxes[:, 0::2] *= scale_w # 缩放 x 坐标
mineru_v132_test | IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed
mineru_v132_test | [2025-04-17 14:58:57,726] [ ERROR] pdf2md.py:120 - pdf file parser error: too many indices for array: array is 1-dimensional, but 2 were indexed
mineru_v132_test | Traceback (most recent call last):
mineru_v132_test | File "/home/volumes/MinerU/src/pdf2md.py", line 100, in pdf2md
mineru_v132_test | mineru_interface(pdf_path, pdf_dir, table_parsing=table_parsing)
mineru_v132_test | File "/home/volumes/MinerU/src/mineru.py", line 21, in mineru_interface
mineru_v132_test | do_parse(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/tools/common.py", line 118, in do_parse
mineru_v132_test | infer_result = ds.apply(
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/data/dataset.py", line 212, in apply
mineru_v132_test | return proc(self, *args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/doc_analyze_by_custom_model.py", line 197, in doc_analyze
mineru_v132_test | analyze_result = batch_model(images)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/batch_analyze.py", line 167, in call
mineru_v132_test | self.model.table_model.predict(new_image)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/magic_pdf/model/sub_modules/table/rapidtable/rapid_table.py", line 63, in predict
mineru_v132_test | table_results = self.table_model(np.asarray(image), ocr_result)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/main.py", line 106, in call
mineru_v132_test | pred_structures, cell_bboxes, _ = self.table_structure(copy.deepcopy(img))
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/torch/utils/_contextlib.py", line 115, in decorate_context
mineru_v132_test | return func(*args, **kwargs)
mineru_v132_test | File "/opt/mineru_venv/lib/python3.10/site-packages/rapid_table/table_structure/table_structure_unitable.py", line 151, in call
mineru_v132_test | bboxes[:, 0::2] *= scale_w # 缩放 x 坐标
mineru_v132_test | IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed
Operating system | 操作系统
Linux
Python version | Python 版本
3.10
Software version | 软件版本 (magic-pdf --version)
1.3.x
Device mode | 设备模式
cuda