NVIDIA-NeMo · chrisalexiuk-nvidia · Apr 10, 2026 · Apr 9, 2026 · Apr 10, 2026 · Apr 10, 2026
diff --git a/usage-cookbook/Nemotron-3-Super/SparkDeploymentGuide/README.md b/usage-cookbook/Nemotron-3-Super/SparkDeploymentGuide/README.md
@@ -1,6 +1,6 @@
 # Nemotron 3 Super — DGX Spark Deployment Guide
 
-DGX Spark ships a single Grace-Blackwell GPU with 128 GB of unified memory. This guide covers serving Nemotron 3 Super on a single DGX Spark using vLLM (nightly) and TensorRT-LLM.
+DGX Spark ships a single Grace-Blackwell GPU with 128 GB of unified memory. This guide covers serving Nemotron 3 Super on a single DGX Spark using vLLM and TensorRT-LLM.
 
 ## Architecture Refresher
 
@@ -32,8 +32,6 @@ wget https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-NVFP4/raw/m
 vllm/vllm-openai:cu130-nightly
 ```
 
-MTP + NVFP4 on DGX Spark requires a vLLM nightly build (cu130). The pinned release `0.17.1` does not support this combination on a single-GPU Spark configuration.
-
 ### Serve Command
 
 ```bash