Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
322b37e
Implementation of HTML backend that uses headless browser (via playwr…
Feb 9, 2026
72b0e69
Added support for input HTML tag (text and checkboxes) into Docling H…
Feb 9, 2026
b68f372
adding key-value extraction for modified HTML backend
Feb 17, 2026
af74209
Allowing out of DOM scope values for the keys that don't have any oth…
Feb 17, 2026
8b9d52f
improved key-value filtering, added padding to the pages as a parameter
Feb 17, 2026
73b9352
Tight bboxes around text when extracting key-values, correct page siz…
Feb 17, 2026
603b0bb
Updated logic for kvp extraction in html_backend, updated example
Feb 20, 2026
1b0657c
Moved playwright into optional dependency
Feb 20, 2026
e9e7b7d
1. Updated example on how to use HTML backend with page rendering; 2.…
Feb 23, 2026
9b96c19
fixes in mp HTML backend example
Feb 23, 2026
de0f352
First implementation of new KV standard from docling-core, WIP
Feb 27, 2026
2ea4a73
Fixed tables, and added value extraction for the HTML input elements …
Feb 27, 2026
b2a5140
HTML_backend: Added support of markers for field_items, respected rea…
Mar 2, 2026
a88b94f
multiple fixes of kvp handling in html_backend, corrected MP example
Mar 3, 2026
3af6184
Fixing reading order within field_region by including also other elem…
Mar 4, 2026
aa985d3
Added heuristic to identify checkbox labels
Mar 5, 2026
07cd511
Extracting checkbox labels and placing them into docling checkboxes, …
Mar 5, 2026
bb8d528
Many fixes for the proper bounding box extraction in case of elements…
Mar 5, 2026
cc12113
Multiple improvements for KVP extraction heuristics
Mar 6, 2026
aff5855
Fixes many issues with reading order / checkboxes / fillable fields i…
Mar 10, 2026
a467dfc
Fixes for linter
Mar 10, 2026
1c0ea7d
Updated multi-processed example of html backend with page rendering f…
Mar 16, 2026
510675a
Support for inline groups for the inline type of content tags: strong…
Mar 16, 2026
09b182d
Updated test cases
Mar 16, 2026
299124e
Updated to latest docling-core, updated tests
Mar 16, 2026
a60741d
fix for mypy
Mar 16, 2026
afb89b4
Added HTML example with KVP
Mar 16, 2026
57d2dfb
Updated all tests according to latest docling-core changes
Mar 23, 2026
23ba52f
Fixes to address PR reviews
Mar 24, 2026
285e8a8
Addressed issue with missing inline groups for formatting elements, l…
Mar 24, 2026
001f95c
re-generated test case of mlac-20251231.xml
Mar 24, 2026
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion .github/workflows/checks.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,7 @@ env:
tests/test_asr_pipeline.py
tests/test_threaded_pipeline.py
PYTEST_TO_SKIP: |-
EXAMPLES_TO_SKIP: '^(batch_convert|compare_vlm_models|minimal|minimal_vlm_pipeline|minimal_asr_pipeline|export_multimodal|custom_convert|develop_picture_enrichment|rapidocr_with_custom_models|suryaocr_with_custom_models|offline_convert|pictures_description|pictures_description_api|vlm_pipeline_api_model|granitedocling_repetition_stopping|mlx_whisper_example|gpu_standard_pipeline|gpu_vlm_pipeline|demo_layout_vlm|post_process_ocr_with_vlm)\.py$|xbrl_conversion\.ipynb$'
EXAMPLES_TO_SKIP: '^(batch_convert|compare_vlm_models|minimal|minimal_vlm_pipeline|minimal_asr_pipeline|export_multimodal|custom_convert|develop_picture_enrichment|rapidocr_with_custom_models|suryaocr_with_custom_models|offline_convert|pictures_description|pictures_description_api|vlm_pipeline_api_model|granitedocling_repetition_stopping|mlx_whisper_example|gpu_standard_pipeline|gpu_vlm_pipeline|demo_layout_vlm|post_process_ocr_with_vlm|run_with_formats_html_rendered|run_with_formats_html_rendered_mp)\.py$|xbrl_conversion\.ipynb$'

jobs:
lint:
Expand Down
Loading
Loading