Change default InferenceClient model to Qwen/Qwen3-Next-80B-A3B-Thinking #1813

suryabdev · 2025-10-13T12:49:53Z

Follow up of #1801 and a Fix for #1808,

The default model for InferenceClientModel is Qwen/Qwen2.5-Coder-32B-Instruct. It does not work because the current default provider doesn't support tool calling (More details in the issue)
This PR changes the default model to Qwen/Qwen3-Next-80B-A3B-Thinking

suryabdev · 2025-10-13T12:50:13Z

cc: @albertvillanova / @aymeric-roucher please review when you are free

aymeric-roucher · 2025-10-13T13:21:19Z

@suryabdev Qwen3-Coder isn't that good in my experience : Qwen/Qwen3-Next-80B-A3B-Thinking worked better (but I only tried a few runs)

suryabdev · 2025-10-13T13:38:24Z

@aymeric-roucher I've only run some basic tests myself to test functionality. Haven't run any benchmarks like GAIA
I've changed the model to Qwen/Qwen3-Next-80B-A3B-Thinking

suryabdev · 2025-10-13T13:41:44Z

Just something to note, the price of the default model will increase, but all providers support tool calling (Doc). From

to

albertvillanova

To chose a new default model, I would recommend benchmarking the candidates and selecting based on their performance.

Another possibility would be to keep the current default model and just add the associated provider that was working until now.

suryabdev · 2025-10-14T08:19:11Z

I would recommend benchmarking the candidates and selecting based on their performance

@albertvillanova That is fair, I haven't run the benchmarks before. Let me try to run them now
https://github.com/huggingface/smolagents/blob/main/examples/smolagents_benchmark/run.py

Another possibility would be to keep the current default model and just add the associated provider that was working until now.

I don't think we should change the default provider for the InferenceClientModel, That might impact situations when a user tries to a different model and doesn't set the provider.

smolagents/src/smolagents/models.py

Line 1416 in 8f4dc91

provider: str | None = None,

The InferenceClient has good auto-picking behavior to choose the cheapest provider.
Adding conditonal logic that adds the provider only if the model is Qwen/Qwen2.5-Coder-32B-Instruct or None could work but doesn't feel very clean to me

aymeric-roucher · 2025-10-14T15:49:57Z

I would be strongly in favor of updating the default model in InferenceClient, as everywhere else : the seed is random anyway so there shouldn't be any conditional logic based on using Qwen/Qwen2.5-Coder-32B-Instruct rather than another model.
Plus Qwen3 series of model is just a net improvement over 2.5 in all aspects : latency, performance, price. So let's not lock ourselves into another model. And anyway, providers will end up discontinuing 2.5 probably before 2026, as new models keep rolling in.

suryabdev · 2025-10-15T08:44:46Z

Ran the benchmark only for the CodeAgent. Qwen/Qwen3-Next-80B-A3B-Thinking is an improvement over Qwen/Qwen2.5-Coder-32B-Instruct

Found some bugs while running the benchmark script so I raised a PR #1822.
I had some questions on running the benchmark for the ToolCallingAgent which I mentioned in that PR

suryabdev · 2025-10-15T08:46:21Z

the seed is random anyway so there shouldn't be any conditional logic based on using Qwen/Qwen2.5-Coder-32B-Instruct rather than another model.

@aymeric-roucher sorry could you elaborate. I didn't fully understand. Do you mean the InferenceClient randomly picks a provider?

aymeric-roucher · 2025-10-15T17:50:55Z

@suryabdev I meant that changing the model should not break working pipelines for users (except if their pipeline has an assert check on the model_id), because there's no expectation of reproducibility anyway when using generation.
So tests likeassert that my pipelines outputs exactly "Hi, I'm an assistant and the answer is A" won't exist (they would be broken by randomness), so we don't have to fear that updating the model could break working pipelines.

aymeric-roucher

Thank you @suryabdev ! 😃 Only need to fix conflicts before going ahead!

suryabdev · 2025-10-16T04:13:15Z

@aymeric-roucher Thanks for the review! I resolved the merge conflicts
You can trigger the PR checks when you are free

…inking

HuggingFaceDocBuilderDev · 2025-10-16T07:31:47Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

albertvillanova

Thanks! We should update the PR title and description accordingly.

albertvillanova · 2025-10-16T07:37:19Z

src/smolagents/models.py

    def __init__(
        self,
-        model_id: str = "Qwen/Qwen3-Next-80B-A3B-Instruct",
+        model_id: str = "Qwen/Qwen3-Next-80B-A3B-Thinking",


Oh, it seems the default model was already changed before this PR! 😲

It was changed in this PR #1801 yesterday, But since there was no release yet no users will be impacted

suryabdev · 2025-10-16T09:00:04Z

We should update the PR title and description accordingly.

@albertvillanova thanks for the review, I've updated the PR title and description
Please merge when you are free

suryabdev added 3 commits October 13, 2025 12:32

Change InferenceClientModel default to Qwen/Qwen3-Coder-30B-A3B-Instruct

f515b84

Update examples and documentation

7204f55

Update other languages

fc2f6bc

suryabdev mentioned this pull request Oct 13, 2025

BUG: Default ToolCallingAgent InferenceClient examples failing #1808

Open

5 tasks

Change model to Qwen3-Next-80B-A3B-Thinking

8ba6b99

albertvillanova reviewed Oct 14, 2025

View reviewed changes

aymeric-roucher approved these changes Oct 15, 2025

View reviewed changes

Merge branch 'main' into fix-1808

4a7e8f0

Change Qwen/Qwen3-Next-80B-A3B-Instruct to Qwen/Qwen3-Next-80B-A3B-Th…

8185259

…inking

albertvillanova approved these changes Oct 16, 2025

View reviewed changes

suryabdev changed the title ~~Change default InferenceClient model to Qwen3-Coder-30B-A3B-Instruct~~ Change default InferenceClient model to Qwen/Qwen3-Next-80B-A3B-Thinking Oct 16, 2025

albertvillanova merged commit 2de6550 into huggingface:main Oct 16, 2025
4 checks passed

Change default InferenceClient model to Qwen/Qwen3-Next-80B-A3B-Thinking #1813

Change default InferenceClient model to Qwen/Qwen3-Next-80B-A3B-Thinking #1813

Uh oh!

Conversation

suryabdev commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suryabdev commented Oct 13, 2025

Uh oh!

aymeric-roucher commented Oct 13, 2025

Uh oh!

suryabdev commented Oct 13, 2025

Uh oh!

suryabdev commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

albertvillanova left a comment

Choose a reason for hiding this comment

Uh oh!

suryabdev commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aymeric-roucher commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suryabdev commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

suryabdev commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aymeric-roucher commented Oct 15, 2025

Uh oh!

aymeric-roucher left a comment

Choose a reason for hiding this comment

Uh oh!

suryabdev commented Oct 16, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 16, 2025

Uh oh!

albertvillanova left a comment

Choose a reason for hiding this comment

Uh oh!

albertvillanova Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

suryabdev Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

suryabdev commented Oct 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

suryabdev commented Oct 13, 2025 •

edited

Loading

suryabdev commented Oct 13, 2025 •

edited

Loading

suryabdev commented Oct 14, 2025 •

edited

Loading

aymeric-roucher commented Oct 14, 2025 •

edited

Loading

suryabdev commented Oct 15, 2025 •

edited

Loading

suryabdev commented Oct 15, 2025 •

edited

Loading