You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[OpenVINO] Connect beam_idx input to Linear Attention Layers (CausalConv, SSM, GDN) (huggingface#1619)
* [OpenVINO] Support beam search for Linear Attention Layers (CausalConv, SSM, GDN)
* Fix formatting issue
* Remove unneeded import
* Apply correct patching for attention mask
* Deprecate arguments in fuse in a correct way
* Apply suggestions from code review
* Correct elements in not_ov_cache_inputs
* Check that cache input names provided
* Fix documentation for fuse_cache_reorder
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
* Apply suggestions from code review
* Apply suggestion from @rkazants
* Apply suggestions from code review
* Apply suggestions from code review
* Update optimum/exporters/openvino/stateful.py
* Apply suggestion from @rkazants
* Apply suggestion from @rkazants
* Apply suggestion from @rkazants
* Fix internal function get_kv_ssm_tensor_names
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
---------
Signed-off-by: Kazantsev, Roman <roman.kazantsev@intel.com>
0 commit comments