Open
Description
debug:
File "/home/zhanghuanhome/miniconda3/lib/python3.12/site-packages/transformers/modeling_attn_mask_utils.py", line 331, in _prepare_4d_causal_attention_mask
attention_mask = attn_mask_converter.to_4d(
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/home/zhanghuanhome/miniconda3/lib/python3.12/site-packages/transformers/modeling_attn_mask_utils.py", line 139, in to_4d
expanded_attn_mask = causal_4d_mask.masked_fill(expanded_attn_mask.bool(), torch.finfo(dtype).min)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
RuntimeError: The size of tensor a (2880) must match the size of tensor b (30) at non-singleton dimension 3
Metadata
Metadata
Assignees
Labels
No labels