Skip to content

Commit eb04aa8

Browse files
authored
set pricing mode for tiered pricing (#1020)
* set pricing mode: cumulative * set pricing mode: marginal * pricing mode: cumulative for gemini models
1 parent 177dde9 commit eb04aa8

72 files changed

Lines changed: 146 additions & 0 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

providers/aws-bedrock/au.anthropic.claude-sonnet-4-5-20250929-v1:0.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ costs:
1919
output:
2020
- cost_per_token: 0.00002475
2121
from: 200000
22+
pricing_mode: marginal
2223
- cache_creation_input_token_cost: 0.000004125
2324
cache_read_input_token_cost: 3.3e-7
2425
input_cost_per_token: 0.0000033
@@ -39,6 +40,7 @@ costs:
3940
output:
4041
- cost_per_token: 0.00002475
4142
from: 200000
43+
pricing_mode: marginal
4244
features:
4345
- function_calling
4446
- prompt_caching

providers/aws-bedrock/global.anthropic.claude-sonnet-4-5-20250929-v1:0.yaml

Lines changed: 32 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ costs:
1919
output:
2020
- cost_per_token: 0.0000225
2121
from: 200000
22+
pricing_mode: marginal
2223
- cache_creation_input_token_cost: 0.00000375
2324
cache_read_input_token_cost: 3e-7
2425
input_cost_per_token: 0.000003
@@ -39,6 +40,7 @@ costs:
3940
output:
4041
- cost_per_token: 0.0000225
4142
from: 200000
43+
pricing_mode: marginal
4244
- cache_creation_input_token_cost: 0.00000375
4345
cache_read_input_token_cost: 3e-7
4446
input_cost_per_token: 0.000003
@@ -59,6 +61,7 @@ costs:
5961
output:
6062
- cost_per_token: 0.0000225
6163
from: 200000
64+
pricing_mode: marginal
6265
- cache_creation_input_token_cost: 0.00000375
6366
cache_read_input_token_cost: 3e-7
6467
input_cost_per_token: 0.000003
@@ -79,6 +82,7 @@ costs:
7982
output:
8083
- cost_per_token: 0.0000225
8184
from: 200000
85+
pricing_mode: marginal
8286
- cache_creation_input_token_cost: 0.00000375
8387
cache_read_input_token_cost: 3e-7
8488
input_cost_per_token: 0.000003
@@ -99,6 +103,7 @@ costs:
99103
output:
100104
- cost_per_token: 0.0000225
101105
from: 200000
106+
pricing_mode: marginal
102107
- cache_creation_input_token_cost: 0.00000375
103108
cache_read_input_token_cost: 3e-7
104109
input_cost_per_token: 0.000003
@@ -119,6 +124,7 @@ costs:
119124
output:
120125
- cost_per_token: 0.0000225
121126
from: 200000
127+
pricing_mode: marginal
122128
- cache_creation_input_token_cost: 0.00000375
123129
cache_read_input_token_cost: 3e-7
124130
input_cost_per_token: 0.000003
@@ -139,6 +145,7 @@ costs:
139145
output:
140146
- cost_per_token: 0.0000225
141147
from: 200000
148+
pricing_mode: marginal
142149
- cache_creation_input_token_cost: 0.00000375
143150
cache_read_input_token_cost: 3e-7
144151
input_cost_per_token: 0.000003
@@ -159,6 +166,7 @@ costs:
159166
output:
160167
- cost_per_token: 0.0000225
161168
from: 200000
169+
pricing_mode: marginal
162170
- cache_creation_input_token_cost: 0.00000375
163171
cache_read_input_token_cost: 3e-7
164172
input_cost_per_token: 0.000003
@@ -179,6 +187,7 @@ costs:
179187
output:
180188
- cost_per_token: 0.0000225
181189
from: 200000
190+
pricing_mode: marginal
182191
- cache_creation_input_token_cost: 0.00000375
183192
cache_read_input_token_cost: 3e-7
184193
input_cost_per_token: 0.000003
@@ -199,6 +208,7 @@ costs:
199208
output:
200209
- cost_per_token: 0.0000225
201210
from: 200000
211+
pricing_mode: marginal
202212
- cache_creation_input_token_cost: 0.00000375
203213
cache_read_input_token_cost: 3e-7
204214
input_cost_per_token: 0.000003
@@ -219,6 +229,7 @@ costs:
219229
output:
220230
- cost_per_token: 0.0000225
221231
from: 200000
232+
pricing_mode: marginal
222233
- cache_creation_input_token_cost: 0.00000375
223234
cache_read_input_token_cost: 3e-7
224235
input_cost_per_token: 0.000003
@@ -239,6 +250,7 @@ costs:
239250
output:
240251
- cost_per_token: 0.0000225
241252
from: 200000
253+
pricing_mode: marginal
242254
- cache_creation_input_token_cost: 0.00000375
243255
cache_read_input_token_cost: 3e-7
244256
input_cost_per_token: 0.000003
@@ -259,6 +271,7 @@ costs:
259271
output:
260272
- cost_per_token: 0.0000225
261273
from: 200000
274+
pricing_mode: marginal
262275
- cache_creation_input_token_cost: 0.00000375
263276
cache_read_input_token_cost: 3e-7
264277
input_cost_per_token: 0.000003
@@ -279,6 +292,7 @@ costs:
279292
output:
280293
- cost_per_token: 0.0000225
281294
from: 200000
295+
pricing_mode: marginal
282296
- cache_creation_input_token_cost: 0.00000375
283297
cache_read_input_token_cost: 3e-7
284298
input_cost_per_token: 0.000003
@@ -299,6 +313,7 @@ costs:
299313
output:
300314
- cost_per_token: 0.0000225
301315
from: 200000
316+
pricing_mode: marginal
302317
- cache_creation_input_token_cost: 0.00000375
303318
cache_read_input_token_cost: 3e-7
304319
input_cost_per_token: 0.000003
@@ -319,6 +334,7 @@ costs:
319334
output:
320335
- cost_per_token: 0.0000225
321336
from: 200000
337+
pricing_mode: marginal
322338
- cache_creation_input_token_cost: 0.00000375
323339
cache_read_input_token_cost: 3e-7
324340
input_cost_per_token: 0.000003
@@ -339,6 +355,7 @@ costs:
339355
output:
340356
- cost_per_token: 0.0000225
341357
from: 200000
358+
pricing_mode: marginal
342359
- cache_creation_input_token_cost: 0.00000375
343360
cache_read_input_token_cost: 3e-7
344361
input_cost_per_token: 0.000003
@@ -359,6 +376,7 @@ costs:
359376
output:
360377
- cost_per_token: 0.0000225
361378
from: 200000
379+
pricing_mode: marginal
362380
- cache_creation_input_token_cost: 0.00000375
363381
cache_read_input_token_cost: 3e-7
364382
input_cost_per_token: 0.000003
@@ -379,6 +397,7 @@ costs:
379397
output:
380398
- cost_per_token: 0.0000225
381399
from: 200000
400+
pricing_mode: marginal
382401
- cache_creation_input_token_cost: 0.00000375
383402
cache_read_input_token_cost: 3e-7
384403
input_cost_per_token: 0.000003
@@ -399,6 +418,7 @@ costs:
399418
output:
400419
- cost_per_token: 0.0000225
401420
from: 200000
421+
pricing_mode: marginal
402422
- cache_creation_input_token_cost: 0.00000375
403423
cache_read_input_token_cost: 3e-7
404424
input_cost_per_token: 0.000003
@@ -419,6 +439,7 @@ costs:
419439
output:
420440
- cost_per_token: 0.0000225
421441
from: 200000
442+
pricing_mode: marginal
422443
- cache_creation_input_token_cost: 0.00000375
423444
cache_read_input_token_cost: 3e-7
424445
input_cost_per_token: 0.000003
@@ -439,6 +460,7 @@ costs:
439460
output:
440461
- cost_per_token: 0.0000225
441462
from: 200000
463+
pricing_mode: marginal
442464
- cache_creation_input_token_cost: 0.00000375
443465
cache_read_input_token_cost: 3e-7
444466
input_cost_per_token: 0.000003
@@ -459,6 +481,7 @@ costs:
459481
output:
460482
- cost_per_token: 0.0000225
461483
from: 200000
484+
pricing_mode: marginal
462485
- cache_creation_input_token_cost: 0.00000375
463486
cache_read_input_token_cost: 3e-7
464487
input_cost_per_token: 0.000003
@@ -479,6 +502,7 @@ costs:
479502
output:
480503
- cost_per_token: 0.0000225
481504
from: 200000
505+
pricing_mode: marginal
482506
- cache_creation_input_token_cost: 0.00000375
483507
cache_read_input_token_cost: 3e-7
484508
input_cost_per_token: 0.000003
@@ -499,6 +523,7 @@ costs:
499523
output:
500524
- cost_per_token: 0.0000225
501525
from: 200000
526+
pricing_mode: marginal
502527
- cache_creation_input_token_cost: 0.00000375
503528
cache_read_input_token_cost: 3e-7
504529
input_cost_per_token: 0.000003
@@ -519,6 +544,7 @@ costs:
519544
output:
520545
- cost_per_token: 0.0000225
521546
from: 200000
547+
pricing_mode: marginal
522548
- cache_creation_input_token_cost: 0.00000375
523549
cache_read_input_token_cost: 3e-7
524550
input_cost_per_token: 0.000003
@@ -539,6 +565,7 @@ costs:
539565
output:
540566
- cost_per_token: 0.0000225
541567
from: 200000
568+
pricing_mode: marginal
542569
- cache_creation_input_token_cost: 0.00000375
543570
cache_read_input_token_cost: 3e-7
544571
input_cost_per_token: 0.000003
@@ -559,6 +586,7 @@ costs:
559586
output:
560587
- cost_per_token: 0.0000225
561588
from: 200000
589+
pricing_mode: marginal
562590
- cache_creation_input_token_cost: 0.00000375
563591
cache_read_input_token_cost: 3e-7
564592
input_cost_per_token: 0.000003
@@ -579,6 +607,7 @@ costs:
579607
output:
580608
- cost_per_token: 0.0000225
581609
from: 200000
610+
pricing_mode: marginal
582611
- cache_creation_input_token_cost: 0.00000375
583612
cache_read_input_token_cost: 3e-7
584613
input_cost_per_token: 0.000003
@@ -599,6 +628,7 @@ costs:
599628
output:
600629
- cost_per_token: 0.0000225
601630
from: 200000
631+
pricing_mode: marginal
602632
- cache_creation_input_token_cost: 0.00000375
603633
cache_read_input_token_cost: 3e-7
604634
input_cost_per_token: 0.000003
@@ -619,6 +649,7 @@ costs:
619649
output:
620650
- cost_per_token: 0.0000225
621651
from: 200000
652+
pricing_mode: marginal
622653
- cache_creation_input_token_cost: 0.00000375
623654
cache_read_input_token_cost: 3e-7
624655
input_cost_per_token: 0.000003
@@ -639,6 +670,7 @@ costs:
639670
output:
640671
- cost_per_token: 0.0000225
641672
from: 200000
673+
pricing_mode: marginal
642674
features:
643675
- function_calling
644676
- prompt_caching

providers/aws-bedrock/jp.anthropic.claude-sonnet-4-5-20250929-v1:0.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ costs:
1919
output:
2020
- cost_per_token: 0.00002475
2121
from: 200000
22+
pricing_mode: marginal
2223
- cache_creation_input_token_cost: 0.000004125
2324
cache_read_input_token_cost: 3.3e-7
2425
input_cost_per_token: 0.0000033
@@ -39,6 +40,7 @@ costs:
3940
output:
4041
- cost_per_token: 0.00002475
4142
from: 200000
43+
pricing_mode: marginal
4244
features:
4345
- function_calling
4446
- prompt_caching

providers/aws-bedrock/us-gov.anthropic.claude-sonnet-4-5-20250929-v1:0.yaml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ costs:
1919
output:
2020
- cost_per_token: 0.000027
2121
from: 200000
22+
pricing_mode: marginal
2223
- cache_creation_input_token_cost: 0.0000045
2324
cache_read_input_token_cost: 3.6e-7
2425
input_cost_per_token: 0.0000036
@@ -39,6 +40,7 @@ costs:
3940
output:
4041
- cost_per_token: 0.000027
4142
from: 200000
43+
pricing_mode: marginal
4244
features:
4345
- function_calling
4446
- prompt_caching

providers/azure-open-ai/gpt-5.4-2026-03-05.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ costs:
1313
output:
1414
- cost_per_token: 0.0000225
1515
from: 272000
16+
pricing_mode: cumulative
1617
deprecationDate: "2027-09-05"
1718
features:
1819
- function_calling

providers/azure-open-ai/gpt-5.4-pro-2026-03-05.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ costs:
1313
output:
1414
- cost_per_token: 0.00027
1515
from: 272000
16+
pricing_mode: cumulative
1617
deprecationDate: "2027-09-05"
1718
features:
1819
- function_calling

providers/deepinfra/ByteDance/Seed-1.8.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ costs:
1313
output:
1414
- cost_per_token: 0.000004
1515
from: 128000
16+
pricing_mode: marginal
1617
features:
1718
- function_calling
1819
- json_output

providers/deepinfra/ByteDance/Seed-2.0-mini.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ costs:
1313
output:
1414
- cost_per_token: 8e-7
1515
from: 128000
16+
pricing_mode: marginal
1617
features:
1718
- function_calling
1819
- json_output

providers/deepinfra/ByteDance/Seed-2.0-pro.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,6 +13,7 @@ costs:
1313
output:
1414
- cost_per_token: 0.000006
1515
from: 128000
16+
pricing_mode: marginal
1617
features:
1718
- function_calling
1819
- prompt_caching

providers/deepinfra/Qwen/Qwen3-Max-Thinking.yaml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,6 +19,7 @@ costs:
1919
from: 32000
2020
- cost_per_token: 0.000015
2121
from: 128000
22+
pricing_mode: marginal
2223
features:
2324
- function_calling
2425
- json_output

0 commit comments

Comments
 (0)