normed_output = model.generate(normed_seqs, max_new_tokens=prediction_length)

debug:
  File "/home/zhanghuanhome/miniconda3/lib/python3.12/site-packages/transformers/modeling_attn_mask_utils.py", line 331, in _prepare_4d_causal_attention_mask
    attention_mask = attn_mask_converter.to_4d(
                     ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/zhanghuanhome/miniconda3/lib/python3.12/site-packages/transformers/modeling_attn_mask_utils.py", line 139, in to_4d
    expanded_attn_mask = causal_4d_mask.masked_fill(expanded_attn_mask.bool(), torch.finfo(dtype).min)
                         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
RuntimeError: The size of tensor a (2880) must match the size of tensor b (30) at non-singleton dimension 3


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

normed_output = model.generate(normed_seqs, max_new_tokens=prediction_length) #58

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

normed_output = model.generate(normed_seqs, max_new_tokens=prediction_length) #58

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions