Skip to content

Commit b56ba3f

Browse files
print unquantized
1 parent 7b0cfab commit b56ba3f

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

vllm/model_executor/layers/linear.py

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -194,7 +194,8 @@ def process_weights_after_loading(self, layer: torch.nn.Module) -> None:
194194
# row parallel linear
195195
assert layer.weight.shape[1] % sp_size == 0
196196
chunk_size = layer.weight.shape[1] // sp_size
197-
weight = layer.weight.split(chunk_size, dim=1)[sp_rank]
197+
weight = layer.weight.split(chunk_size,
198+
dim=1)[sp_rank].contiguous()
198199
else:
199200
# column parallel linear
200201
assert layer.weight.shape[0] % sp_size == 0

0 commit comments

Comments
 (0)