LLM: release plugin once pipeline is removed and WA for GPU #2102

sbalandi · 2025-04-23T10:30:01Z

No description provided.

sbalandi · 2025-04-23T21:18:25Z

checked the memory on Linux/Windows and got the same results as in the task, most of the memory is released after removing the pipeline, but 40-60 MB remains(but that tail is not the goal of that pr)

@Wovchena please, take a look

ilya-lavrenov and others added 2 commits April 22, 2025 20:08

LLM: release plugin once pipeline is removed

e229f25

LLM: release plugin once pipeline is removed and WA for GPU

1eba25f

sbalandi marked this pull request as ready for review April 23, 2025 10:30

github-actions bot added category: continuous batching Continuous batching category: LLM LLM pipeline (stateful, static) category: tokenizers Tokenizer class or submodule update category: GenAI C++ API Changes in GenAI C++ public headers no-match-files labels Apr 23, 2025

sbalandi mentioned this pull request Apr 23, 2025

LLM: release plugin once pipeline is removed and WA for GPU #1846

Open

sbalandi requested review from Wovchena, ilya-lavrenov and andrei-kochin April 23, 2025 11:02

update comments

1d2bce2

Merge branch 'master' into plugin_clean

6d96179

ilya-lavrenov added this to the 2025.2 milestone Apr 24, 2025

ilya-lavrenov assigned ilya-lavrenov and Wovchena Apr 24, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLM: release plugin once pipeline is removed and WA for GPU #2102

LLM: release plugin once pipeline is removed and WA for GPU #2102

sbalandi commented Apr 23, 2025

sbalandi commented Apr 23, 2025

LLM: release plugin once pipeline is removed and WA for GPU #2102

Are you sure you want to change the base?

LLM: release plugin once pipeline is removed and WA for GPU #2102

Conversation

sbalandi commented Apr 23, 2025

sbalandi commented Apr 23, 2025