Description
Hello,
I'm planning to use ecologits to compare utilization between Claude 3 and Claude 3.5, but I've noticed an inconsistency in the model configurations: https://github.com/genai-impact/ecologits/blob/main/ecologits/data/models.json
The file lists:
- "claude-3-5-sonnet-latest" with 440B parameters (55 - 220B active)
- "claude-3-sonnet-20240229" with 800B parameters (100 - 400B active)
While I understand these are estimates due to the unknown architecture, it's surprising that Claude 3 Sonnet, which is older, is listed with more parameters than Claude 3.5 Sonnet.
Could you please clarify or verify these parameter counts? It seems counterintuitive that the newer model would have fewer parameters.
Example Code