Commit a8dfb18
authored
[GPU] Reuse kv cache mem if it is not changed from previous infer (#28361)
### Details:
- When kv cache variable is reset, it is allocating a new memory.
- However, if the variable mem is not changed from previous iteration,
we can reuse previsouly allocated memory
### Tickets:
- *ticket-id*1 parent f616896 commit a8dfb18
File tree
4 files changed
+30
-7
lines changed- src/plugins/intel_gpu
- include/intel_gpu/graph
- src
- graph
- plugin
4 files changed
+30
-7
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
193 | 193 | | |
194 | 194 | | |
195 | 195 | | |
| 196 | + | |
| 197 | + | |
196 | 198 | | |
197 | 199 | | |
198 | 200 | | |
| |||
216 | 218 | | |
217 | 219 | | |
218 | 220 | | |
| 221 | + | |
219 | 222 | | |
220 | 223 | | |
221 | 224 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1028 | 1028 | | |
1029 | 1029 | | |
1030 | 1030 | | |
| 1031 | + | |
| 1032 | + | |
| 1033 | + | |
| 1034 | + | |
1031 | 1035 | | |
1032 | 1036 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
624 | 624 | | |
625 | 625 | | |
626 | 626 | | |
627 | | - | |
| 627 | + | |
| 628 | + | |
| 629 | + | |
| 630 | + | |
| 631 | + | |
| 632 | + | |
| 633 | + | |
628 | 634 | | |
629 | | - | |
630 | | - | |
| 635 | + | |
| 636 | + | |
631 | 637 | | |
632 | | - | |
633 | | - | |
| 638 | + | |
| 639 | + | |
| 640 | + | |
634 | 641 | | |
| 642 | + | |
| 643 | + | |
635 | 644 | | |
636 | | - | |
637 | 645 | | |
638 | 646 | | |
639 | 647 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
295 | 295 | | |
296 | 296 | | |
297 | 297 | | |
| 298 | + | |
298 | 299 | | |
299 | 300 | | |
300 | 301 | | |
| 302 | + | |
| 303 | + | |
| 304 | + | |
| 305 | + | |
| 306 | + | |
| 307 | + | |
| 308 | + | |
| 309 | + | |
301 | 310 | | |
302 | 311 | | |
303 | 312 | | |
304 | | - | |
305 | 313 | | |
306 | 314 | | |
307 | 315 | | |
| |||
0 commit comments