A collection of playbooks and recipes for deploying AI models and workloads on NVIDIA DGX Spark systems.
This repository contains ready-to-use playbooks for running various AI models and frameworks optimized for NVIDIA DGX Spark GPUs. Each playbook includes Docker configurations, build scripts, and comprehensive documentation.
Run the Nemotron Nano VL 12B V2 vision-language model using vLLM with FP4 quantization.
- Model: nvidia/Nemotron-Nano-VL-12B-V2-FP4-QAD
- Framework: vLLM (optimized for inference)
- Quantization: FP4 for reduced memory usage
- API: OpenAI-compatible REST API
- Use Cases: Vision-language tasks, multimodal AI applications
spark-playbooks/
├── README.md # This file
├── run-nemotron-v2-VL/ # Nemotron VL playbook
│ ├── README.md # Playbook-specific documentation
│ ├── build.sh # Build script
│ ├── run.sh # Run script
│ ├── Dockerfile # Docker configuration
│ └── patch1.patch # Custom patches
└── [other-playbooks]/ # Additional playbooks
└── README.md # Playbook-specific documentation
- Choose a playbook from the list above
- Navigate to the playbook directory:
cd [playbook-name] - Follow the playbook's README.md for detailed instructions
All playbooks in this repository require:
- NVIDIA DGX Spark system with GPU support
- Docker installed with NVIDIA Container Toolkit
- NVIDIA GPU drivers (recommended: latest stable version)
- Sufficient disk space (varies by playbook)
Check that your system is ready:
# Check NVIDIA drivers
nvidia-smi
# Check Docker NVIDIA runtime
docker run --rm --gpus all nvidia/cuda:12.0.0-base-ubuntu22.04 nvidia-smi
# Check available disk space
df -hWe welcome contributions! To add a new playbook:
- Create a new directory with a descriptive name
- Include all necessary files (Dockerfile, scripts, patches)
- Add a comprehensive README.md in the playbook directory
- Update this main README.md to list your playbook
Each playbook should include:
- Overview: What the playbook does
- Prerequisites: Specific requirements
- Build Instructions: Step-by-step build process
- Run Instructions: How to launch and configure
- Usage Examples: API calls, code samples
- Configuration: Tuning parameters
- Troubleshooting: Common issues and solutions
For playbook-specific issues, refer to the individual playbook's README.md documentation.
For general DGX Spark system issues, contact NVIDIA support.
See LICENSE file for details.