Misc. bug: Unable to use the OpenVINO NPU backend

### Name and Version

version: 9444 (b9444)
built with GNU 16.1.1 for Linux x86_64

### Operating systems

_No response_

### Which llama.cpp modules do you know to be affected?

_No response_

### Command line

```shell
GGML_OPENVINO_DEVICE=NPU llama-cli --predict 256 --ctx-size 4096 --device OPENVINO0 --model Qwen2.5-3B-Instruct-Q4_0.gguf
```

### Problem description & steps to reproduce

Run the command above.

### First Bad Commit

_No response_

### Relevant log output

<details>
<summary>Logs</summary>


```console
0.39.958.062 E GGML OpenVINO backend ov::Exception: Exception from src/inference/src/cpp/infer_request.cpp:224:
Check 'dst->get_element_type() == get_element_type()' failed at src/core/src/runtime/itensor.cpp:75:
Tensor element types are not equal. (src: f32 != dst: f16)


0.39.958.070 E graph_compute: ggml_backend_sched_graph_compute_async failed with error -1
0.39.958.071 E process_ubatch: failed to compute graph, compute status: -1
0.39.958.083 E llama_decode: failed to decode, ret = -3
0.39.965.952 E GGML OpenVINO backend ov::Exception: Exception from src/inference/src/cpp/infer_request.cpp:224:
Check 'dst->get_element_type() == get_element_type()' failed at src/core/src/runtime/itensor.cpp:75:
Tensor element types are not equal. (src: f32 != dst: f16)


0.39.965.954 E graph_compute: ggml_backend_sched_graph_compute_async failed with error -1
0.39.965.954 E process_ubatch: failed to compute graph, compute status: -1
0.39.965.959 E llama_decode: failed to decode, ret = -3
0.39.965.959 E common_context_can_seq_rm: llama_decode() failed: -3
```
</details>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Misc. bug: Unable to use the OpenVINO NPU backend #23984

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Misc. bug: Unable to use the OpenVINO NPU backend #23984

Description

Name and Version

Operating systems

Which llama.cpp modules do you know to be affected?

Command line

Problem description & steps to reproduce

First Bad Commit

Relevant log output

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions