|
TVM_FFI_ICHECK( |
|
fc1_weight_block.size(0) == num_experts_on_rank && |
|
fc1_weight_block.size(1) == |
|
TmaWarpSpecializedGroupedGemmInput::alignToSfDim( |
|
inter_size, TmaWarpSpecializedGroupedGemmInput::MinNDimAlignmentMXFPX) * |
|
2 && |
|
fc1_weight_block.size(2) * FP8_PER_INT32 * |
|
TmaWarpSpecializedGroupedGemmInput::MXFPXBlockScaleVectorSize == |
|
TmaWarpSpecializedGroupedGemmInput::alignToSfDim( |
|
hidden_size, TmaWarpSpecializedGroupedGemmInput::MinKDimAlignmentMXFPX)) |
|
<< "fc1 weight block size must be (num_experts_on_rank, inter_size * 2, hidden_size // 4 " |
|
"// block_scale_vector_size)"; |
This check is triggered for both non-gated and gated-activations. This should only be triggered if we are using gated activations.
flashinfer/csrc/fused_moe/cutlass_backend/flashinfer_cutlass_fused_moe_binding.cu
Lines 893 to 904 in 2bb3e9e
This check is triggered for both non-gated and gated-activations. This should only be triggered if we are using gated activations.