Commit fb0c870
Optimize get_inputs_embeds() for Qwen2VL. (openvinotoolkit#2037)
Image embeddings merger is moved to a separate method and in chat mode
it is used only when new images are passed.
Time measures are below for 6 chat iterations for Qwen2-VL-2B-Instruct,
where image is passed on the first and third iteration.
This branch:
Chat iteration 1 (new image):
encode time: 2012 ms
get_inputs_embeds time: 7683 ms
Chat iteration 2:
encode time: 0 ms
get_inputs_embeds time: 7 ms
Chat iteration 3 (new image):
encode time: 2359 ms
get_inputs_embeds time: 29179 ms
Chat iteration 4:
encode time: 0 ms
get_inputs_embeds time: 10 ms
Chat iteration 5:
encode time: 0 ms
get_inputs_embeds time: 11 ms
Chat iteration 6:
encode time: 0 ms
get_inputs_embeds time: 8 ms
Master:
Chat iteration 1 (new image):
encode time: 1893ms
get_inputs_embeds time: 8394ms
Chat iteration 2:
encode time: 0ms
get_inputs_embeds time: 7664ms
Chat iteration 3 (new image):
encode time: 2126ms
get_inputs_embeds time: 27954ms
Chat iteration 4:
encode time: 0ms
get_inputs_embeds time: 27944ms
Chat iteration 5:
encode time: 0ms
get_inputs_embeds time: 27974ms
Chat iteration 6:
encode time: 0ms
get_inputs_embeds time: 27970ms
---------
Co-authored-by: Ilya Lavrenov <ilya.lavrenov@intel.com>
Co-authored-by: Vladimir Zlobin <vladimir.zlobin@intel.com>1 parent b4ed057 commit fb0c870
File tree
15 files changed
+84
-51
lines changed- src/cpp/src
- visual_language
- internvl_chat
- llava_next
- llava
- minicpm
- phi3_vision
- qwen2vl
15 files changed
+84
-51
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
174 | 174 | | |
175 | 175 | | |
176 | 176 | | |
| 177 | + | |
177 | 178 | | |
178 | 179 | | |
179 | | - | |
180 | 180 | | |
| 181 | + | |
181 | 182 | | |
182 | 183 | | |
183 | | - | |
| 184 | + | |
| 185 | + | |
| 186 | + | |
| 187 | + | |
184 | 188 | | |
185 | 189 | | |
186 | 190 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
244 | 244 | | |
245 | 245 | | |
246 | 246 | | |
247 | | - | |
248 | | - | |
| 247 | + | |
| 248 | + | |
249 | 249 | | |
250 | 250 | | |
251 | 251 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
37 | 37 | | |
38 | 38 | | |
39 | 39 | | |
40 | | - | |
| 40 | + | |
41 | 41 | | |
42 | 42 | | |
43 | 43 | | |
| |||
98 | 98 | | |
99 | 99 | | |
100 | 100 | | |
101 | | - | |
| 101 | + | |
102 | 102 | | |
103 | 103 | | |
104 | 104 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
226 | 226 | | |
227 | 227 | | |
228 | 228 | | |
229 | | - | |
| 229 | + | |
230 | 230 | | |
231 | 231 | | |
232 | 232 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
38 | | - | |
| 38 | + | |
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
114 | 114 | | |
115 | 115 | | |
116 | 116 | | |
117 | | - | |
| 117 | + | |
118 | 118 | | |
119 | 119 | | |
120 | 120 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
38 | | - | |
| 38 | + | |
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
343 | 343 | | |
344 | 344 | | |
345 | 345 | | |
346 | | - | |
| 346 | + | |
347 | 347 | | |
348 | 348 | | |
349 | 349 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
22 | 22 | | |
23 | 23 | | |
24 | 24 | | |
25 | | - | |
| 25 | + | |
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
566 | 566 | | |
567 | 567 | | |
568 | 568 | | |
569 | | - | |
| 569 | + | |
570 | 570 | | |
571 | 571 | | |
572 | 572 | | |
| |||
0 commit comments