Description
The application raises exception on Arm CPU using OpenVINO GenAI 2024.6 package
`root@lemans:/data/tts# ./meloTTS_ov --model_dir ov_models --input_file ./inputs_en.txt
[INFO] Set ONEDNN_PRIMITIVE_CACHE_CAPACITY: 100
[INFO] Set ONEDNN_MAX_CPU_ISA: AVX2_VNNI
[INFO] OpenVinoTokenizer Initialized!
Set CPU_RUNTIME_CACHE_CAPACITY 0
OpenVINO:OpenVINO Runtime
Version : 2024.6.0
Build : 2024.6.0-17404-4c0f47d2335-releases/2024/6
Model Device info:openvino_arm_cpu_plugin
Version : 2024.6.0
Build : 2024.6.0-17404-4c0f47d2335-releases/2024/6
CMUDict::CMUDict: Construct CMUDict
[INFO] English::Init English language Module Succeed!
Set CPU_RUNTIME_CACHE_CAPACITY 0
[INFO] MiniBartG2P: use_past is false.
[INFO] Construct MiniBartG2P succeeded.
OpenVINO:OpenVINO Runtime
Version : 2024.6.0
Build : 2024.6.0-17404-4c0f47d2335-releases/2024/6
Model Device info:openvino_arm_cpu_plugin
Version : 2024.6.0
Build : 2024.6.0-17404-4c0f47d2335-releases/2024/6
[INFO] Engilish:: Init MiniBartG2P Succeed!
Set CPU_RUNTIME_CACHE_CAPACITY 0
OpenVINO:OpenVINO Runtime
Version : 2024.6.0
Build : 2024.6.0-17404-4c0f47d2335-releases/2024/6
Model Device info:openvino_arm_cpu_plugin
Version : 2024.6.0
Build : 2024.6.0-17404-4c0f47d2335-releases/2024/6
TTS::TTS : init bert_model
TTS::TTS : open puncuation dict.
model init time is3104 ms
[INFO] bert infer time: 28ms
std::runtime_error: Exception from src/inference/src/cpp/infer_request.cpp:223:
Exception from src/plugins/intel_cpu/src/node.cpp:725:
[CPU] Pad node with name '__module.model.flow.flows.6.enc.attn_layers.0/aten::pad' Exception from src/plugins/intel_cpu/src/memory_desc/cpu_memory_desc.h:89:
ParameterMismatch: Can not clone with new dims. Descriptor's shape: {1, 9 - ?, 96} is incompatible with provided dimensions: {1, 1, 96}.
model infer time:5243 ms
root@lemans:/data/tts# `