Skip to content

Commit 9aa8d24

Browse files
authored
Super tiny add Qwen3-4B-Instruct-2507 model config (#661)
1 parent ad98b01 commit 9aa8d24

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed
Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
MODEL_ARGS_ROTARY_BASE=5000000 source "$(dirname "$0")/qwen3-4B.sh"

scripts/models/qwen3-4B.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ MODEL_ARGS=(
1010
--disable-bias-linear
1111
--normalization "RMSNorm"
1212
--norm-epsilon 1e-6
13-
--rotary-base 1000000
13+
--rotary-base "${MODEL_ARGS_ROTARY_BASE:-1000000}"
1414
--vocab-size 151936
1515
--kv-channels 128
1616
--qk-layernorm

0 commit comments

Comments
 (0)