Qwen 3 VL 4B Instruct FP8 Model Setup

Step-by-Step Instructions

First, create a model folder in Backend.AI with the name that matches the --local-dir parameter (without the /home/work prefix).

Model Folder Name: qwen3-vl-4b-instruct-fp8

Create a batch-type session and mount the model folder:

Session Type: Batch
Mount the qwen3-vl-4b-instruct-fp8 folder to /home/work/qwen3-vl-4b-instruct-fp8

Once you have your batch session running with the mounted folder, execute the following command:

hf download --local-dir /home/work/qwen3-vl-4b-instruct-fp8 Qwen/Qwen3-VL-4B-Instruct-FP8

Important: The --local-dir path must match where you mounted your model folder.

The model will be downloaded from Hugging Face Hub
Ensure you have sufficient storage space for the 4B FP8 model
The download path /home/work/qwen3-vl-4b-instruct-fp8 should match your mounted folder path
This model is quantized in FP8 format for efficient inference