Skip to content

v1.2.0 - Major Compatibility Update & VRAM Management

Choose a tag to compare

@wildminder wildminder released this 03 Sep 06:54
· 29 commits to main since this release
2aa03a8

This is a significant update that resolves critical compatibility issues with the latest transformers library, introduces a powerful new VRAM management feature, and fixes several stability bugs. Users are strongly encouraged to update

🚀 Key Features & Improvements

  • Transformers 4.56+ Compatibility: This release fixes the critical _prepare_cache_for_generation() takes 6 positional arguments but 7 were given error, making the node fully compatible with the latest versions of the transformers library. The fix is backwards compatible, so it will continue to work seamlessly with older versions as well.

  • 🔧 New force_offload Parameter: A new toggle has been added to the node to force the model to offload from VRAM after each generation. This is incredibly useful for complex workflows or systems with limited VRAM, helping to prevent out-of-memory errors.

    • Keep it enabled for maximum memory savings between runs.
    • Keep it disabled for faster subsequent generations if you have sufficient VRAM.
image
  • 🗣️ Enhanced Multi-Speaker Stability: Fixed a critical bug related to DynamicCache in newer transformers versions that caused errors during multi-speaker audio generation. You can now reliably use multiple speakers without issues.

🐛 Bug Fixes

This release also addresses several underlying bugs to improve stability and compatibility with ComfyUI:

  • Fixed ComfyUI API Incompatibility: Resolved an error by replacing a call to the non-existent unload_model_clones() with the correct unload_all_models() function from ComfyUI's model management API.
  • Fixed AttributeError on Offload: Corrected an AttributeError: 'VibeVoicePatcher' object has no attribute 'is_loaded' that occurred when using the new force_offload feature.
  • Fixed DynamicCache Error: The code no longer incorrectly attempts to access .key_cache on DynamicCache objects, which resolves errors and ensures multi-speaker functionality works correctly with recent library updates.

✅ What This Means For You

  • Upgrade Safely: You can now update your transformers library without worrying about breaking the VibeVoice node.
  • Better VRAM Management: Use the force_offload option to free up precious GPU memory for other tasks in your workflow.
  • More Reliability: Multi-speaker generation is now stable on the latest libraries, and interactions with ComfyUI's core are more robust.

💾 How to Install

via the ComfyUI Manager or do it manually:

git clone https://github.com/wildminder/ComfyUI-VibeVoice

💾 How to Upgrade

Update via the ComfyUI Manager, or navigate to your ComfyUI/custom_nodes/ComfyUI-VibeVoice directory and run:

git pull

Then, restart ComfyUI.


A huge thank you to the community for reporting these issues. This update makes the VibeVoice node more stable, flexible, and future-proof.