Skip to content

Conversation

@Yu-Jack
Copy link
Collaborator

@Yu-Jack Yu-Jack commented Feb 6, 2026

@Yu-Jack Yu-Jack self-assigned this Feb 6, 2026
@github-actions
Copy link

github-actions bot commented Feb 6, 2026

Name Link
🔨 Latest commit 83822b5
😎 Deploy Preview https://698592054d4685a6871e5cfe--harvester-preview.netlify.app

- Add warning to restart VMs immediately after modifying PCI/vGPU devices
- Document annotation to skip addon disable check when VMs are stopped with devices attached
- Explain synchronization importance between VM spec and running state

Signed-off-by: Jack Yu <[email protected]>
@Yu-Jack Yu-Jack marked this pull request as ready for review February 9, 2026 01:43
Copilot AI review requested due to automatic review settings February 9, 2026 01:43
Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds user-facing warnings to Harvester advanced add-on docs to reduce misconfiguration risk when managing PCI/vGPU devices on VMs, aligning with the problem described in harvester/harvester#9531.

Changes:

  • Add a warning to restart VMs immediately after adding/removing PCI devices.
  • Document an annotation-based escape hatch to bypass the add-on disable check for the PCI devices controller.
  • Add equivalent warning + annotation guidance for vGPU management via the NVIDIA Driver Toolkit add-on.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File Description
docs/advanced/addons/pcidevices.md Adds a warning about VM restart after PCI device changes and documents a skip-check annotation for add-on disable operations.
docs/advanced/addons/nvidiadrivertoolkit.md Adds a warning about VM restart after vGPU changes and documents a skip-check annotation for add-on disable operations.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copy link
Member

@w13915984028 w13915984028 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, some suggestions.

BTW, in the create vm main page https://docs.harvesterhci.io/v1.7/vm/index, there is no link to gpu/pci/usb ..., @jillian-maroket should we add links to addons? thanks.


This is accomplished by using the `pcidevices-controller` addon.

To use the PCI devices feature, users need to enable the `pcidevices-controller` addon first.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

could we add one line on L22 about the new disable check on v180? new L62 could be moved to here or kept.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can see how @jillian-maroket would like to arrange this.

@@ -32,6 +32,16 @@ On pod startup, the ENTRYPOINT script will download the NVIDIA driver from the s

The `PCIDevices` addon can now leverage this addon to manage the lifecycle of the vGPU devices on nodes containing supported GPU [devices](../vgpusupport.md).

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similar, L34 about the new check when disabling addon


**Always restart the VM immediately after attaching or detaching vGPU devices.**

Although rebooting the VM after editing its spec is not mandatory, we strongly recommend doing so to ensure proper synchronization. Without an immediate reboot, the addon disable check might not accurately detect devices in use.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instant rebooting helps check on addon rebooting, also avoids potential resources conflicts even when addon is still enabled

Copy link
Member

@WebberHuang1118 WebberHuang1118 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks for the enhancement.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants