Skip to content

Commit cbaf338

Browse files
authored
Merge pull request #326 from cyberofficial/vultr
Fix: Adjust Vultr Model Context and Output Limits Based on Empirical Testing
2 parents c4f5962 + e0dbb54 commit cbaf338

File tree

5 files changed

+13
-13
lines changed

5 files changed

+13
-13
lines changed

providers/vultr/models/deepseek-r1-distill-llama-70b.toml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,8 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 32_768
16+
context = 121_808
17+
output = 8_192
1818

1919
[modalities]
2020
input = ["text"]

providers/vultr/models/deepseek-r1-distill-qwen-32b.toml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -13,8 +13,8 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 32_768
16+
context = 121_808
17+
output = 8_192
1818

1919
[modalities]
2020
input = ["text"]

providers/vultr/models/gpt-oss-120b.toml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 131_072
16+
context = 121_808
17+
output = 8_192
1818

1919
[modalities]
2020
input = ["text"]
21-
output = ["text"]
21+
output = ["text"]

providers/vultr/models/kimi-k2-instruct.toml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 16_384
16+
context = 58_904
17+
output = 4_096
1818

1919
[modalities]
2020
input = ["text"]
21-
output = ["text"]
21+
output = ["text"]

providers/vultr/models/qwen2.5-coder-32b-instruct.toml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,9 @@ input = 0.20
1313
output = 0.20
1414

1515
[limit]
16-
context = 128_000
17-
output = 32_768
16+
context = 12_952
17+
output = 2_048
1818

1919
[modalities]
2020
input = ["text"]
21-
output = ["text"]
21+
output = ["text"]

0 commit comments

Comments
 (0)