bug/None text attribute when normalizing Picture to Image element

**Describe the bug**
When using `yolox` as the Hi-res model for loading outputs / annotations, it annotates with bbox dimensions but missing text for complex images, resulting in `text` being set to `None`. But later on printing or accessing the Image element (`__str__` method), it should be returning `string` instead of `None`.

**To Reproduce**
```python
from langchain_community.document_loaders.image import UnstructuredImageLoader
from unstructured_inference.models.base import DEFAULT_MODEL

import os

img_loader = UnstructuredImageLoader(
            "5.jpg", # can rename the attached images
            hi_res_model_name=DEFAULT_MODEL,
        )

data = img_loader.load()

for i in data:
    print(i)
```

OR

```python
from unstructured.partition.image import partition_image

elements = partition_image("5.jpg", hi_res_model_name=DEFAULT_MODEL)

print(elements)

for el in elements:
    print(el)
```


**Expected behavior**
Even if an image is detected with bbox but missing text, we should set the text to empty string instrad of "None" which ends up with exception when we try to print Image element (`__str__` method).

**Screenshots**

<img width="829" height="243" alt="Image" src="https://github.com/user-attachments/assets/4a40c8c5-4ee0-4395-830b-d62724184b44" />

Test Images

<img width="802" height="462" alt="Image" src="https://github.com/user-attachments/assets/caac9fc2-95da-4882-b447-d3631e8d8336" />

<img width="455" height="502" alt="Image" src="https://github.com/user-attachments/assets/15956a85-524c-419c-acf5-6eadadaeb3ca" />

---

**Environment Info**
```
unstructured             0.18.13
unstructured-client      0.42.3
unstructured-inference   1.0.5
unstructured-pytesseract 0.3.15
detectron2               0.6
torch                    2.8.0
torchvision              0.23.0
```

**Additional context**
This issue is also related to this [issue](https://github.com/langchain-ai/langchain-community/issues/310).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bug/None text attribute when normalizing Picture to Image element #4084

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

bug/None text attribute when normalizing Picture to Image element #4084

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions