Skip to content

Conversation

@kevint-cerebras
Copy link
Contributor

Adding zai-glm-4.6 as a model for Cerebras.

This will go live on Nov. 5th, and replace Qwen 3 Coder, which will be deprecated.

Adds Z.AI GLM-4.6 model hosted by Cerebras with:
- Context window: 128k tokens (131,072)
- Max completion: 40k tokens (40,960)
- Free pricing (0 input/output)
- Text-only modalities
- No prompt caching
@rekram1-node
Copy link
Contributor

To avoid people having issues if they try to use it I will hold off on merging this until it is released

@danielkim-cerebras
Copy link

We are rolling out glm access to existing customers starting today - could we get this merged?

@rekram1-node
Copy link
Contributor

rekram1-node commented Oct 30, 2025

Im dumb I didn't realize yall were cerebras, merging

didnt read ur name 🤦‍♂️

@rekram1-node rekram1-node merged commit b97ee92 into sst:dev Oct 30, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants