[Prototype] Add interactive deployment config generator by mgoin · Pull Request #318 · vllm-project/recipes

mgoin · 2026-04-07T19:35:12Z

Summary

Adds a reusable interactive configuration selector widget for vLLM deployment recipes, inspired by SGLang Cookbook's ConfigGenerator
Built with vanilla HTML/CSS/JS that works natively with MkDocs Material (no React or build step needed)
Includes a proof-of-concept in the Llama 3.3 70B recipe with options for hardware platform, quantization, tensor parallelism, and prefix caching
Supports light/dark mode via MkDocs Material CSS variables, dynamic option dependencies (e.g. NVFP4 auto-disabled on Hopper), and copy-to-clipboard

How to add to other recipes

Recipe authors just add a <div> and <script> block (~40 lines) defining model-specific options and a generateCommand function. The base component (assets/config-generator.js + assets/config-generator.css) handles all the rendering and state management.

Test plan

Run mkdocs serve and verify the Llama 3.3 70B recipe shows the interactive selector
Click through hardware/quantization/TP options and verify the generated command updates correctly
Verify NVFP4 is disabled when Hopper is selected
Verify dark mode styling works
Verify copy button works

🤖 Generated with Claude Code

Add a reusable interactive configuration selector widget (similar to SGLang Cookbook's ConfigGenerator) that lets users pick hardware platform, quantization, tensor parallelism, and other options to auto-generate the correct vllm serve command. Includes a proof-of-concept integration in the Llama 3.3 70B recipe with options for Blackwell/Hopper hardware, NVFP4/FP8 quantization, TP size, and prefix caching. Other recipes can adopt the widget by adding a small HTML/JS block that defines model-specific options and a generateCommand function. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

gemini-code-assist

Code Review

This pull request introduces an interactive command generator for Llama 3.3-70B deployment, allowing users to select hardware, quantization, and parallelism settings to generate a vLLM serve command. The implementation includes a new vanilla JavaScript utility, custom CSS, and updates to the documentation and mkdocs configuration. Review feedback focuses on ensuring the generated command correctly overrides YAML defaults for prefix caching, optimizing the initialization logic in the JavaScript class, and adding safety checks for the clipboard API.

gemini-code-assist · 2026-04-07T19:38:21Z

Llama/Llama3.3-70B.md

+      if (values.prefixCaching === 'disabled') {
+        cmd += ' \\\n  --no-enable-prefix-caching';
+      }


The current logic only adds the --no-enable-prefix-caching flag when "Disabled" is selected. However, the provided YAML configuration files (Llama3.3_Blackwell.yaml and Llama3.3_Hopper.yaml) already have no-enable-prefix-caching: true set. This means that selecting "Enabled" in the interactive tool will not actually enable prefix caching, as the YAML setting will persist. To ensure the interactive tool works as expected, you should explicitly add the --enable-prefix-caching flag when "Enabled" is selected, as CLI arguments typically override configuration file settings in vLLM.

Suggested change

if (values.prefixCaching === 'disabled') {

cmd += ' \\\n --no-enable-prefix-caching';

}

if (values.prefixCaching === 'enabled') {

cmd += ' \\\n --enable-prefix-caching';

} else {

cmd += ' \\\n --no-enable-prefix-caching';

}

gemini-code-assist · 2026-04-07T19:38:22Z

assets/config-generator.js

+  _getInitialState() {
+    const state = {};
+    for (const [key, option] of Object.entries(this.config.options)) {
+      let items = option.items;
+      if (option.getDynamicItems) {
+        // Bootstrap: build default values from static items first
+        const defaults = {};
+        for (const [k, opt] of Object.entries(this.config.options)) {
+          if (opt.items && opt.items.length > 0) {
+            const d = opt.items.find(i => i.default);
+            defaults[k] = d ? d.id : opt.items[0].id;
+          }
+        }
+        items = option.getDynamicItems(defaults);
+      }
+      const defaultItem = items && items.find(i => i.default);
+      state[key] = defaultItem ? defaultItem.id : (items && items[0] ? items[0].id : '');
+    }
+    return state;
+  }


The defaults object is currently recalculated inside the loop for every option that uses getDynamicItems. It is more efficient to pre-calculate these defaults once at the beginning of the method.

Suggested change

_getInitialState() {

const state = {};

for (const [key, option] of Object.entries(this.config.options)) {

let items = option.items;

if (option.getDynamicItems) {

// Bootstrap: build default values from static items first

const defaults = {};

for (const [k, opt] of Object.entries(this.config.options)) {

if (opt.items && opt.items.length > 0) {

const d = opt.items.find(i => i.default);

defaults[k] = d ? d.id : opt.items[0].id;

}

}

items = option.getDynamicItems(defaults);

}

const defaultItem = items && items.find(i => i.default);

state[key] = defaultItem ? defaultItem.id : (items && items[0] ? items[0].id : '');

}

return state;

}

_getInitialState() {

const state = {};

const defaults = {};

for (const [k, opt] of Object.entries(this.config.options)) {

if (opt.items && opt.items.length > 0) {

const d = opt.items.find(i => i.default);

defaults[k] = d ? d.id : opt.items[0].id;

}

}

for (const [key, option] of Object.entries(this.config.options)) {

let items = option.items;

if (option.getDynamicItems) {

items = option.getDynamicItems(defaults);

}

const defaultItem = items && items.find(i => i.default);

state[key] = defaultItem ? defaultItem.id : (items && items[0] ? items[0].id : '');

}

return state;

}

gemini-code-assist · 2026-04-07T19:38:22Z

assets/config-generator.js

+    copyBtn.addEventListener('click', () => {
+      navigator.clipboard.writeText(command).then(() => {
+        copyBtn.textContent = 'Copied!';
+        setTimeout(() => { copyBtn.textContent = 'Copy'; }, 1500);
+      });
+    });


Accessing navigator.clipboard without checking for its existence can lead to runtime errors in non-secure contexts (HTTP) or older browsers where the API is not available.

Suggested change

copyBtn.addEventListener('click', () => {

navigator.clipboard.writeText(command).then(() => {

copyBtn.textContent = 'Copied!';

setTimeout(() => { copyBtn.textContent = 'Copy'; }, 1500);

});

});

copyBtn.addEventListener('click', () => {

if (!navigator.clipboard) return;

navigator.clipboard.writeText(command).then(() => {

copyBtn.textContent = 'Copied!';

setTimeout(() => { copyBtn.textContent = 'Copy'; }, 1500);

});

});

ProExpertProg

Looks nice!

gemini-code-assist bot reviewed Apr 7, 2026

View reviewed changes

ProExpertProg approved these changes Apr 7, 2026

View reviewed changes

mgoin marked this pull request as draft April 7, 2026 19:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Prototype] Add interactive deployment config generator#318

[Prototype] Add interactive deployment config generator#318
mgoin wants to merge 1 commit intomainfrom
add-interactive-config-generator

mgoin commented Apr 7, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Apr 7, 2026

Uh oh!

gemini-code-assist bot Apr 7, 2026

Uh oh!

gemini-code-assist bot Apr 7, 2026

Uh oh!

ProExpertProg left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

mgoin commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

How to add to other recipes

Test plan

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

ProExpertProg left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mgoin commented Apr 7, 2026 •

edited

Loading