-
Notifications
You must be signed in to change notification settings - Fork 60
Add Qwen3 and Qwen3-MoE #41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
@Goekdeniz-Guelmez can you combine efforts with @Blaizzy here and merge in #42. The two PRs were back to back so let's make you both co-authors. But I think since this one came in first unless there is a good reason we can use it as the working draft. |
Awesome, lets collaborate once again @Goekdeniz-Guelmez! |
If you are interested you can check the @Goekdeniz-Guelmez moe. still needs some work |
Wow, that was actually amazing timing :D, yes your PR should merge singer you was faster then me, I'll just look at your code here and give you some comments, and delete my PR. |
#44 Here is the draft PR, im now trying inference with it, I'll tell you how it goes. |
Awesome! |
Hey @awni Fixed the bug on the MoE, unlike previous v2, the v3 now normalizes the experts topk probs. We are ready to review, merge and wait for the official release! |
Thanks! |
My pleasure! |
No description provided.