feat/parse_html_embed_objects

I am trying to parse HTML documents containing embedded images and youtube videos inside iframe. I am able to use partition_html function get textual elements, as well metdata object containing ahref tags. However the image element as well iframe elements are being missed out.


I would like to have these data points made available either as separete elements like HTMLImage, HTMLIframe or attach these link urls as well made available as part of the metadata object's link_urls.




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat/parse_html_embed_objects #2233

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

feat/parse_html_embed_objects #2233

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions