Nemotron 3 Super DGX Spark Deployment Guide broken

**Describe the bug**

Nemotron 3 Super setup steps for the DGX Spark Setup  are broken [1] 

**Steps/Code to reproduce bug**

Follow the steps from the guide (https://docs.nvidia.com/nemotron/nightly/usage-cookbook/Nemotron-3-Super/SparkDeploymentGuide/README.html#vllm) to get vLLM running from docker image and available on port 8000

**Expected behavior**

Resulting openai compatible endpoint isnt working (tested from latest version of Pi and Opencode).


**Additional context**

Might be related with a vllm reasoning config issue upstream but despite that the steps are not functioning.

Sometimes you can see how the agent leaks <thinking> or  </output> messages in the conversation.

[1]  https://docs.nvidia.com/nemotron/nightly/usage-cookbook/Nemotron-3-Super/SparkDeploymentGuide/README.html#

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nemotron 3 Super DGX Spark Deployment Guide broken #132

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Nemotron 3 Super DGX Spark Deployment Guide broken #132

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions