[QNN-EP] Update gather op input tensor cast logic. (#26835)

quic-muchhsu · edgchen1 · web-flow · commit 925b90b87a57 · 2025-12-20T00:53:16.000Z
### Description
&lt;!-- Describe your changes. --&gt;
Gather op was referring to onnx graph when deciding whether to insert
`Cast-&gt;int32` on indices. But input tensor is created by QNN and it
could already casted into int32. Which cause mismatch and resulting
adding redundant Cast. This PR changes Gather Op builder to refer to QNN
tenser before adding int64-&gt;int32 cast.


### Motivation and Context
&lt;!-- - Why is this change required? What problem does it solve?
- If it fixes an open issue, please link to the issue here. --&gt;
It solve QNN-Gather op not to insert redundant Cast-&gt;int32.

---------

Signed-off-by: Mu-Chein Hsu &lt;quic_muchhsu@quicinc.com&gt;
Co-authored-by: Edward Chen &lt;18449977+edgchen1@users.noreply.github.com&gt;
diff --git a/onnxruntime/core/providers/qnn/builder/opbuilder/gather_op_builder.cc b/onnxruntime/core/providers/qnn/builder/opbuilder/gather_op_builder.cc
@@ -160,8 +160,11 @@ static Status ProcessIndicesInput(QnnModelWrapper& qnn_model_wrapper,
   }
 
   // Insert QNN Cast op to convert dynamic indices from int64 to int32.
+  const auto& input_tensorwrapper = qnn_model_wrapper.GetQnnTensorWrapper(indices_tensor_name);
+
   std::string indices_casted_name{indices_tensor_name};
-  if (indices_info.qnn_data_type == QNN_DATATYPE_INT_64) {
+  // Check QNN Tensor data type.
+  if (input_tensorwrapper.GetTensorDataType() == QNN_DATATYPE_INT_64) {
     assert(!indices_info.is_initializer);
     indices_casted_name += "_int32";
     if (qnn_model_wrapper.IsQnnTensorWrapperExist(indices_casted_name)) {