Skip to content

Commit 40922f0

Browse files
committed
Merge branch 'main' of github.com:DS4SD/docling-core into robustify-page-filtering
2 parents 211b82a + 7bd274b commit 40922f0

File tree

92 files changed

+12187
-703
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

92 files changed

+12187
-703
lines changed

CHANGELOG.md

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,21 @@
1+
## [v2.51.1](https://github.com/docling-project/docling-core/releases/tag/v2.51.1) - 2025-11-14
2+
3+
### Fix
4+
5+
* Improve meta migration ([#422](https://github.com/docling-project/docling-core/issues/422)) ([`bc0e96b`](https://github.com/docling-project/docling-core/commit/bc0e96b9dc298d2e96ab2b4ce9faa4165d661b94))
6+
* DoclingDocument model validator should deal with any raw input ([#419](https://github.com/docling-project/docling-core/issues/419)) ([`56b3c42`](https://github.com/docling-project/docling-core/commit/56b3c42c61dbca7e9aa4a44fae18ecaadb482f81))
7+
8+
## [v2.51.0](https://github.com/docling-project/docling-core/releases/tag/v2.51.0) - 2025-11-12
9+
10+
### Feature
11+
12+
* Add code chunking functionality ([#398](https://github.com/docling-project/docling-core/issues/398)) ([`3097645`](https://github.com/docling-project/docling-core/commit/3097645198915a1258cfe6e1d5df3b5f1c79395a))
13+
14+
### Fix
15+
16+
* Improve meta migration and warning handling ([#417](https://github.com/docling-project/docling-core/issues/417)) ([`3d13b02`](https://github.com/docling-project/docling-core/commit/3d13b02756f1c0d1f1ccab5cfbd76f1f888a0dd9))
17+
* Fix import handling of extra dependencies for chunking ([#418](https://github.com/docling-project/docling-core/issues/418)) ([`567d3ad`](https://github.com/docling-project/docling-core/commit/567d3ada57e19b2a738991ae6e49d55dd3301b17))
18+
119
## [v2.50.1](https://github.com/docling-project/docling-core/releases/tag/v2.50.1) - 2025-11-04
220

321
### Fix

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# Docling Core
22

33
[![PyPI version](https://img.shields.io/pypi/v/docling-core)](https://pypi.org/project/docling-core/)
4-
![Python](https://img.shields.io/badge/python-3.9%20%7C%203.10%20%7C%20%203.11%20%7C%203.12%20%7C%203.13-blue)
4+
![Python](https://img.shields.io/badge/python-3.9%20%7C%203.10%20%7C%20%203.11%20%7C%203.12%20%7C%203.13%20%7C%203.14-blue)
55
[![uv](https://img.shields.io/endpoint?url=https://raw.githubusercontent.com/astral-sh/uv/main/assets/badge/v0.json)](https://github.com/astral-sh/uv)
66
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
77
[![Imports: isort](https://img.shields.io/badge/%20imports-isort-%231674b1?style=flat&labelColor=ef8336)](https://pycqa.github.io/isort/)
@@ -21,7 +21,7 @@ pip install docling-core
2121

2222
### Development setup
2323

24-
To develop for Docling Core, you need Python 3.9 / 3.10 / 3.11 / 3.12 / 3.13 and uv. You can then install from your local clone's root dir:
24+
To develop for Docling Core, you need Python3.9 through 3.14 and the `uv` package. You can then install it from your local clone's root directory:
2525
```bash
2626
uv sync --all-extras
2727
```

docling_core/transforms/chunker/__init__.py

Lines changed: 14 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,9 +6,19 @@
66
"""Define the chunker types."""
77

88
from docling_core.transforms.chunker.base import BaseChunk, BaseChunker, BaseMeta
9-
from docling_core.transforms.chunker.hierarchical_chunker import (
10-
DocChunk,
11-
DocMeta,
12-
HierarchicalChunker,
9+
from docling_core.transforms.chunker.code_chunking.base_code_chunking_strategy import (
10+
BaseCodeChunkingStrategy,
1311
)
12+
from docling_core.transforms.chunker.code_chunking.code_chunk import (
13+
CodeChunk,
14+
CodeChunkType,
15+
CodeDocMeta,
16+
)
17+
from docling_core.transforms.chunker.code_chunking.standard_code_chunking_strategy import (
18+
StandardCodeChunkingStrategy,
19+
)
20+
from docling_core.transforms.chunker.doc_chunk import DocChunk, DocMeta
21+
from docling_core.transforms.chunker.hierarchical_chunker import HierarchicalChunker
22+
from docling_core.transforms.chunker.hybrid_chunker import HybridChunker
1423
from docling_core.transforms.chunker.page_chunker import PageChunker
24+
from docling_core.types.doc.labels import CodeLanguageLabel
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
"""Code chunking package."""

0 commit comments

Comments
 (0)