Ollama openvino integration #953

FionaZZ92 · 2025-03-21T11:24:46Z

Add new module of Ollama-OV which integrate OpenVINO GenAI as backend engine of Ollama to accelerate LLM inference on Intel platforms (CPU/iGPU/dGPU/NPU).

current feature: LLM serving with OV genAI 2025RC2 supported model with single request/response token inference.
will add:

streaming token generation - Done.
update to 2025RC2. (Current 2025.2.0.0.dev20250320) - Done

ilya-lavrenov · 2025-03-24T07:55:22Z

current feature: LLM serving with OV genAI 2025RC1 supported model with single request/response token inference

will it be beneficial in future to reuse OpenVINO GenAI Cont Baching pipeline which is faster in case of multiple requests?

ilya-lavrenov · 2025-03-24T08:04:37Z

could you please also add a single github actions workflow to check that integration is working?

FionaZZ92 · 2025-03-24T09:28:11Z

current feature: LLM serving with OV genAI 2025RC1 supported model with single request/response token inference

will it be beneficial in future to reuse OpenVINO GenAI Cont Baching pipeline which is faster in case of multiple requests?

Our current integration just adopt ov::genai::LLMPipeline basic API integration, and the streaming generate with callback func is WIP. All these two workload are single client single server mode. I do not sure the invoke condition with CB under this mode, and optimization necessary. I think situation should be all aligned with current genAI benchmark.

For multi-requests, yes, that should be have potential benefits. It will require to wrapper interface for ov::genai::ContinuousBatchingPipeline. I think it should be similar way of OVMS. I won't expect this feature in nearly version with 25.1 FRC, unless has customer request directly relevant to it.

FionaZZ92 · 2025-03-24T09:36:42Z

could you please also add a single github actions workflow to check that integration is working?

I think it is not make sense to add .github action in this folder cause github will think it is subrepo require to have hyperlink with the official repo. It means we have to maintain another official repo. If you really want to have an action check, do you think we add this submodule source compiling & test to here is enough? I think no need to download all model to test, that's too heavy.

Disable cgocheck for runtime as well.

zhaohb · 2025-03-26T12:12:15Z

could you please also add a single github actions workflow to check that integration is working?

Hi @ilya-lavrenov We have added an action workflow and verified that the integration is working, please review it.

FionaZZ92 · 2025-03-27T06:55:22Z

Able to merge it now?

FionaZZ92 · 2025-03-27T10:22:40Z

@alvoron Learn that you are the one of owners for openvino_contrib, could you please help to review this PR. And help to approve merge if no more change need.

FionaZZ92 added 3 commits March 21, 2025 18:37

Add new module in README

7f002e9

Add ollama_openvino module

bcbcac9

Add ollama_openvino module

409e3c9

FionaZZ92 requested a review from a team as a code owner March 21, 2025 11:24

github-actions bot added the category: build OpenVINO cmake script / infra label Mar 21, 2025

FionaZZ92 assigned ilya-lavrenov Mar 21, 2025

Add code of Ollama-OV module

d53b9ad

ilya-lavrenov approved these changes Mar 24, 2025

View reviewed changes

zhaohb and others added 2 commits March 24, 2025 21:46

genai support streaming mode

1c93587

Update genai version(2025.2.0.0.dev20250320) along with exe

dc2c136

Disable cgocheck for runtime as well.

zhaohb mentioned this pull request Mar 24, 2025

Inference with OpenVINO on Intel ollama/ollama#2169

Open

FionaZZ92 requested a review from a team as a code owner March 26, 2025 05:57

github-actions bot added the category: CI OpenVINO public CI label Mar 26, 2025

Fiona Zhao and others added 13 commits March 26, 2025 14:02

Add action compile check

4dc4bbf

Merge branch 'openvinotoolkit:master' into ollama_openvino

b2e3b2b

add ollama_openvino_build_and_test.yml

5efa843

Modify ollama_openvino_build_and_test.yml

c6c599c

modify ollama_openvino_build_and_test.yml go install step

1322f16

modify workflow

99b59f2

debug workflow

3e1f533

debug workflow

375f019

update workflow build

3a0abb4

update workflow build

f8a6125

update workflow test

d68196b

update workflow test

5287423

debug workflow

715db4d

ilya-lavrenov and others added 25 commits March 26, 2025 12:37

Update mac.yml

6640451

Update linux.yml

bb58563

update workflow test

f238118

update workflow test

9253441

Update windows.yml

fba4d27

update ollama_ov workflow

2775235

update workflow test

44908ee

update ollama_ov workflow

0e0c5cd

debug workflow

6c8ea29

debug workflow

d786b7e

debug workflow

26c344d

update ollama_ov workflow

7f85361

update ollama_ov workflow

a1bcf4e

update ollama_ov workflow

45eac5f

update ollama_ov workflow

d485ac5

update ollama_ov workflow

c5cd00e

update ollama_ov workflow

7070a70

update ollama_ov workflow

2a68a20

update ollama_ov workflow

46aafc0

update ollama_ov workflow

b965b11

update ollama_ov workflow

e1613fe

update ollama_ov workflow

fbfc07f

update ollama_ov workflow

ff26a3d

update ollama_ov workflow

66257d5

update ollama_ov workflow

11ba167

FionaZZ92 assigned alvoron Mar 27, 2025

ilya-lavrenov merged commit 0e838a4 into openvinotoolkit:master Mar 27, 2025
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ollama openvino integration #953

Ollama openvino integration #953

Uh oh!

FionaZZ92 commented Mar 21, 2025 •

edited

Loading

Uh oh!

ilya-lavrenov commented Mar 24, 2025

Uh oh!

ilya-lavrenov commented Mar 24, 2025

Uh oh!

FionaZZ92 commented Mar 24, 2025

Uh oh!

FionaZZ92 commented Mar 24, 2025 •

edited

Loading

Uh oh!

zhaohb commented Mar 26, 2025

Uh oh!

FionaZZ92 commented Mar 27, 2025

Uh oh!

FionaZZ92 commented Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Ollama openvino integration #953

Ollama openvino integration #953

Uh oh!

Conversation

FionaZZ92 commented Mar 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ilya-lavrenov commented Mar 24, 2025

Uh oh!

ilya-lavrenov commented Mar 24, 2025

Uh oh!

FionaZZ92 commented Mar 24, 2025

Uh oh!

FionaZZ92 commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zhaohb commented Mar 26, 2025

Uh oh!

FionaZZ92 commented Mar 27, 2025

Uh oh!

FionaZZ92 commented Mar 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

FionaZZ92 commented Mar 21, 2025 •

edited

Loading

FionaZZ92 commented Mar 24, 2025 •

edited

Loading