LLM: release plugin once pipeline is removed and WA for GPU #1846

p-durandin · 2025-03-05T08:43:54Z

Based on #1627 and WA for GPU oneDNN cache clean added

Wovchena · 2025-04-21T07:11:56Z

src/cpp/src/icontinuous_batching.cpp

-template<class... Ts> overloaded(Ts...) -> overloaded<Ts...>;
+ContinuousBatchingPipeline::IContinuousBatchingPipeline::~IContinuousBatchingPipeline() {
+    m_tokenizer = {};
+}


IContinuousBatchingPipeline is ContinuousBatchingImpl's parent. The order of destructor calls is reverse:

~ContinuousBatchingImpl()

~IContinuousBatchingPipeline()

By the time ~IContinuousBatchingPipeline() is called, utils::release_core_plugin(m_device) from ContinuousBatchingImpl() had already been executed. Given that the solution is satisfactory, you could have defined ~IContinuousBatchingPipeline() = default; and get the same result. But maybe a better solution would be to move utils::release_core_plugin(m_device) to IContinuousBatchingPipeline() (m_device is already there). This would fix the call order, remove the need to manually clear ContinuousBatchingImpl's members and enable clearing for other children: PromptLookupImpl and SpeculativeDecodingImpl.

@Wovchena I don't have right to modify that PR, so I have applied comment here #2102

ilya-lavrenov · 2025-04-25T19:56:30Z

replaced by #2102

ilya-lavrenov and others added 5 commits January 24, 2025 10:31

LLM: release plugin once pipeline is removed

ad81b21

Merge remote-tracking branch 'upstream/master' into release-plugin

b5b2906

Merge remote-tracking branch 'upstream/master' into release-plugin

76faf64

Merge remote-tracking branch 'upstream/master' into release-plugin

9ec1d82

LLM: release plugin once pipeline is removed and WA for GPU

c136de9

github-actions bot added category: continuous batching Continuous batching category: LLM LLM pipeline (stateful, static) category: tokenizers Tokenizer class or submodule update category: CPP API Changes in GenAI C++ public headers no-match-files labels Mar 5, 2025

LLM: release plugin once pipeline is removed and WA for GPU

7efad75

p-durandin added the do_not_merge label Mar 5, 2025

p-durandin requested a review from andrei-kochin March 5, 2025 09:50

p-durandin added 2 commits March 6, 2025 16:06

Merge branch 'master' into plugin_clean

492a89a

Merge branch 'master' into plugin_clean

ecaa35d

p-durandin assigned ilya-lavrenov Apr 18, 2025

Wovchena requested changes Apr 21, 2025

View reviewed changes

ilya-lavrenov closed this Apr 25, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

LLM: release plugin once pipeline is removed and WA for GPU #1846

LLM: release plugin once pipeline is removed and WA for GPU #1846

Uh oh!

p-durandin commented Mar 5, 2025

Uh oh!

Wovchena Apr 21, 2025

Uh oh!

sbalandi Apr 23, 2025

Uh oh!

ilya-lavrenov commented Apr 25, 2025

Uh oh!

Uh oh!

LLM: release plugin once pipeline is removed and WA for GPU #1846

LLM: release plugin once pipeline is removed and WA for GPU #1846

Uh oh!

Conversation

p-durandin commented Mar 5, 2025

Uh oh!

Wovchena Apr 21, 2025

Choose a reason for hiding this comment

Uh oh!

sbalandi Apr 23, 2025

Choose a reason for hiding this comment

Uh oh!

ilya-lavrenov commented Apr 25, 2025

Uh oh!

Uh oh!