Skip to content

Commit e806db5

Browse files
authored
missing models (#61)
* missing models * meta llama models folder
1 parent c8aa265 commit e806db5

22 files changed

Lines changed: 134 additions & 5 deletions
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
model: deepseek-ai/deepseek-ocr-maas
2+
costs:
3+
- region: '*'
4+
input_cost_per_token: 3.e-7
5+
output_cost_per_token: 0.0000012
6+
mode: chat
Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
model: deepseek-ai/deepseek-r1-0528-maas
2+
costs:
3+
- region: '*'
4+
input_cost_per_token: 0.00000135
5+
output_cost_per_token: 0.0000054
6+
input_cost_per_token_batches: 6.75e-7
7+
output_cost_per_token_batches: 0.0000027
8+
mode: chat
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
model: deepseek-ai/deepseek-v3.1-maas
2+
costs:
3+
- region: '*'
4+
input_cost_per_token: 6.e-7
5+
output_cost_per_token: 0.0000017
6+
cache_read_input_token_cost: 6.e-8
7+
input_cost_per_token_batches: 3.e-7
8+
output_cost_per_token_batches: 8.5e-7
9+
mode: chat
Lines changed: 9 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,9 @@
1+
model: deepseek-ai/deepseek-v3.2-maas
2+
costs:
3+
- region: '*'
4+
input_cost_per_token: 5.6e-7
5+
output_cost_per_token: 0.00000168
6+
cache_read_input_token_cost: 5.6e-8
7+
input_cost_per_token_batches: 2.8e-7
8+
output_cost_per_token_batches: 8.4e-7
9+
mode: chat

providers/google-vertex/llama-3.1-405b-instruct-maas.yaml renamed to providers/google-vertex/meta/llama-3.1-405b-instruct-maas.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
model: llama-3.1-405b-instruct-maas
1+
model: meta/llama-3.1-405b-instruct-maas
22
costs:
33
- region: '*'
44
input_cost_per_token: 0.000005

providers/google-vertex/llama-3.1-70b-instruct-maas.yaml renamed to providers/google-vertex/meta/llama-3.1-70b-instruct-maas.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
model: llama-3.1-70b-instruct-maas
1+
model: meta/llama-3.1-70b-instruct-maas
22
costs:
33
- region: '*'
44
input_cost_per_token: 0

providers/google-vertex/llama-3.1-8b-instruct-maas.yaml renamed to providers/google-vertex/meta/llama-3.1-8b-instruct-maas.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
model: llama-3.1-8b-instruct-maas
1+
model: meta/llama-3.1-8b-instruct-maas
22
costs:
33
- region: '*'
44
input_cost_per_token: 0
Lines changed: 8 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,8 @@
1+
model: meta/llama-3.3-70b-instruct-maas
2+
costs:
3+
- region: '*'
4+
input_cost_per_token: 7.2e-7
5+
output_cost_per_token: 7.2e-7
6+
input_cost_per_token_batches: 3.6e-7
7+
output_cost_per_token_batches: 3.6e-7
8+
mode: chat

providers/google-vertex/llama-4-maverick-17b-128e-instruct-maas.yaml renamed to providers/google-vertex/meta/llama-4-maverick-17b-128e-instruct-maas.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,10 @@
1-
model: llama-4-maverick-17b-128e-instruct-maas
1+
model: meta/llama-4-maverick-17b-128e-instruct-maas
22
costs:
33
- region: '*'
44
input_cost_per_token: 3.5e-7
55
output_cost_per_token: 0.00000115
6+
input_cost_per_token_batches: 1.75e-7
7+
output_cost_per_token_batches: 5.75e-7
68
limits:
79
max_tokens: 1000000
810
max_input_tokens: 1000000

providers/google-vertex/llama-4-scout-17b-16e-instruct-maas.yaml renamed to providers/google-vertex/meta/llama-4-scout-17b-16e-instruct-maas.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,10 @@
1-
model: llama-4-scout-17b-16e-instruct-maas
1+
model: meta/llama-4-scout-17b-16e-instruct-maas
22
costs:
33
- region: '*'
44
input_cost_per_token: 2.5e-7
55
output_cost_per_token: 7.e-7
6+
input_cost_per_token_batches: 1.25e-7
7+
output_cost_per_token_batches: 3.5e-7
68
limits:
79
max_tokens: 10000000
810
max_input_tokens: 10000000

0 commit comments

Comments
 (0)