Skip to content

Commit e0dbb54

Browse files
committed
Update context and output limits for Vultr models
Adjusted the 'context' and 'output' token limits in TOML configs for deepseek-r1-distill-llama-70b, deepseek-r1-distill-qwen-32b, gpt-oss-120b, kimi-k2-instruct, and qwen2.5-coder-32b-instruct models to reflect new capacity constraints.
1 parent 13d6d49 commit e0dbb54

File tree

5 files changed

+13
-13
lines changed

5 files changed

+13
-13
lines changed

providers/vultr/models/deepseek-r1-distill-llama-70b.toml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,8 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 32_768
16+
context = 121_808
17+
output = 8_192
1818

1919
[modalities]
2020
input = ["text"]

providers/vultr/models/deepseek-r1-distill-qwen-32b.toml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,8 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 32_768
16+
context = 121_808
17+
output = 8_192
1818

1919
[modalities]
2020
input = ["text"]

providers/vultr/models/gpt-oss-120b.toml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 131_072
16+
context = 121_808
17+
output = 8_192
1818

1919
[modalities]
2020
input = ["text"]
21-
output = ["text"]
21+
output = ["text"]

providers/vultr/models/kimi-k2-instruct.toml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 16_384
16+
context = 58_904
17+
output = 4_096
1818

1919
[modalities]
2020
input = ["text"]
21-
output = ["text"]
21+
output = ["text"]

providers/vultr/models/qwen2.5-coder-32b-instruct.toml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 32_768
16+
context = 12_952
17+
output = 2_048
1818

1919
[modalities]
2020
input = ["text"]
21-
output = ["text"]
21+
output = ["text"]

0 commit comments

Comments
 (0)