[Feature Request]: Multimodal LLMReranker

### Feature Description

A node reranker that can handle multimodal data. Today, AFAIK, this would only affect images and text since I do not know of a case (except for perhaps custom retriever implementations) when any retrievers return audio/video data. This allows information that is in say a powerpoint image to be ranked according to a search query.

### Reason

In order to achieve [the goal of Multimodal Pipelines/Engines](https://github.com/run-llama/llama_index/issues/15667), we need node postprocessors that can support multimodal data. 

### Value of Feature

In certain document types (especially pptx, but also some pdfs), considerable amounts of information may be stored in images. Today, node postprocessors cannot handle ImageNodes. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Multimodal LLMReranker #20742

Feature Description

Reason

Value of Feature

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request]: Multimodal LLMReranker #20742

Description

Feature Description

Reason

Value of Feature

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions