Summary
When using budget models (Minimax M2.7, DeepSeek 4 Pro), billing is calculated at GPT-5.5 rates instead of the selected model's rates.
Evidence
| Chat |
Model |
Billed |
Expected |
Overcharge |
| pip dependency fix |
Minimax M2.7 |
$9.68 |
$0.53 |
$9.15 (18x) |
| Open Interpreter |
DeepSeek 4 Pro |
$4.47 |
$0.49 |
$3.98 (9x) |
| Khoj setup |
Mimo 2.5 Pro |
$10.85 |
~$10.50 |
✅ Correct |
Total overcharge: ~$13.13
Example calculation (Step 119, Minimax M2.7)
- Input: 144,602 tokens, Output: 168 tokens
- Billed: $0.96
- Expected (Minimax M2.7): $0.052
- GPT 5.5 rate: $0.96 ✅ exact match
Models API pricing (from /api/models)
| Model |
Input ($/M) |
Output ($/M) |
Tier |
| Minimax M2.7 |
$0.36 |
$1.44 |
$ (budget) |
| DeepSeek 4 Pro |
$2.52 |
$5.28 |
$$ |
| GPT 5.5 |
$6.00 |
$36.00 |
$$$ |
Impact
Budget models are being billed at flagship prices. Users selecting cheap models get unexpectedly high charges.
Environment
- OS: Windows 11
- Date: 2026-05-18/19
- Chat API response IDs:
chatcmpl-* format (OpenAI-compatible)
Suggested fix
- Use selected model's pricing for cost calculation
- Add transparency: show which model's rates are being used
- Warn when cost deviates significantly from expected
Summary
When using budget models (Minimax M2.7, DeepSeek 4 Pro), billing is calculated at GPT-5.5 rates instead of the selected model's rates.
Evidence
Total overcharge: ~$13.13
Example calculation (Step 119, Minimax M2.7)
Models API pricing (from
/api/models)Impact
Budget models are being billed at flagship prices. Users selecting cheap models get unexpectedly high charges.
Environment
chatcmpl-*format (OpenAI-compatible)Suggested fix