A few weeks ago I had updated my GPU driver from 590.48.1 to 595.xx.yy, when I tried to run a standard LTx 2.3 workflow that I had ran minutes before updating the driver, my generation times went from 90-200 seconds to minutes before the ksampler even showed progress. Watching my systems resource monitor I noticed that comfyui was streaming data from cache instead of reading the model, vae, and clip then loading them to vram.
Thought it might be torch related and needed to be upgraded, so I upgraded torch from 2.10 + cu130 to 2.12 + cu132, rebuild sageattention wheels, and fixed any other issues with my uv environment, yet vram was still not running correctly. Even updating comfyui and restarting pc had no effect. At that point I rolled my driver back to 590 and everything worked just fine.
Fast forward to june 6/7, I was using musubi trainer to train an ltx 2.3 lora in it own uv environment, the training finished up and I went to test the lora, suddenly with nothing loaded in Vram, 52GB of ram available, and 92GB of swap file free, comfyui is ooming. I removed the lora, comfyui still ooms, I restart my pc and comfyui, still oom.
I try to run LTx 2.3 wf again, but I monitor everything with kijais memory visualizer, looks like torch loads stuff to vram but then pins WAY more than it needs and ooms after it attempts to offload everything. I go through and try other workflows that I know take a lot memory, Klein no issues, Wan 2.2 no issues unless I set the resolution higher.
Currently I'm playing around with --fast launch options when this happened, both times I had --fast cublas_ops
I don't have a good steps to reproduce, each time its usually been while training loras.
the first time when rolling back drivers fixed the issue, I was using Rs-nodes lora training nodes.
This time I'm using musubi tuner in a seperate uv enviroment from comfyui, and driver 590.48.1
Here is the last oom log with --fast cublab_ops:
[2026-06-07 20:37:42.153] !!! Exception during processing !!! Allocation on device 0 would exceed allowed memory. (out of memory)
Currently allocated : 11.33 GiB
Requested : 5.10 GiB
Device limit : 23.51 GiB
Free (according to CUDA): 103.06 MiB
PyTorch limit (set by user-supplied memory fraction)
: 17179869184.00 GiB
[2026-06-07 20:37:42.156] Traceback (most recent call last):
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 536, in execute
output_data, output_ui, has_subgraph, has_pending_tasks = await get_output_data(prompt_id, unique_id, obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 336, in get_output_data
return_values = await _async_map_node_over_list(prompt_id, unique_id, obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/custom_nodes/comfyui-lora-manager/py/metadata_collector/metadata_hook.py", line 171, in async_map_node_over_list_with_metadata
results = await original_map_node_over_list(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
...<2 lines>...
)
^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 310, in _async_map_node_over_list
await process_inputs(input_dict, i)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 298, in process_inputs
result = f(**inputs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_api/internal/__init__.py", line 149, in wrapped_func
return method(locked_class, **inputs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_api/latest/_io.py", line 1880, in EXECUTE_NORMALIZED
to_return = cls.execute(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_extras/nodes_custom_sampler.py", line 1047, in execute
samples = guider.sample(noise.generate_noise(latent), latent_image, sampler, sigmas, denoise_mask=noise_mask, callback=callback, disable_pbar=disable_pbar, seed=noise.seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1316, in sample
output = executor.execute(noise, latent_image, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1254, in outer_sample
output = self.inner_sample(noise, latent_image, device, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1229, in inner_sample
samples = executor.execute(self, sigmas, extra_args, callback, noise, latent_image, denoise_mask, disable_pbar)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 999, in sample
samples = self.sampler_function(model_k, noise, sigmas, extra_args=extra_args, callback=k_callback, disable=disable_pbar, **self.extra_options)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
return func(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/k_diffusion/sampling.py", line 205, in sample_euler
denoised = model(x, sigma_hat * s_in, **extra_args)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 639, in __call__
out = self.inner_model(x, sigma, model_options=model_options, seed=seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1202, in __call__
return self.outer_predict_noise(*args, **kwargs)
~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1209, in outer_predict_noise
).execute(x, timestep, model_options, seed)
~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1212, in predict_noise
return sampling_function(self.inner_model, x, timestep, self.conds.get("negative", None), self.conds.get("positive", None), self.cfg, model_options=model_options, seed=seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 619, in sampling_function
out = calc_cond_batch(model, conds, x, timestep, model_options)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 210, in calc_cond_batch
return _calc_cond_batch_outer(model, conds, x_in, timestep, model_options)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 218, in _calc_cond_batch_outer
return executor.execute(model, conds, x_in, timestep, model_options)
~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 334, in _calc_cond_batch
output = model.apply_model(input_x, timestep_, **c).chunk(batch_chunks)
~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/model_base.py", line 187, in apply_model
return comfy.patcher_extension.WrapperExecutor.new_class_executor(
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...<2 lines>...
comfy.patcher_extension.get_all_wrappers(comfy.patcher_extension.WrappersMP.APPLY_MODEL, transformer_options)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
).execute(x, t, c_concat, c_crossattn, control, transformer_options, **kwargs)
~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/model_base.py", line 231, in _apply_model
model_output = self.diffusion_model(xc, t, context=context, control=control, transformer_options=transformer_options, **extra_conds)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 1069, in forward
return super().forward(
~~~~~~~~~~~~~~~^
x,
^^
...<6 lines>...
**kwargs,
^^^^^^^^^
)
^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 936, in forward
return comfy.patcher_extension.WrapperExecutor.new_class_executor(
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...<4 lines>...
),
~~
).execute(x, timestep, context, attention_mask, frame_rate, transformer_options, keyframe_idxs, denoise_mask=denoise_mask, **kwargs)
~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 989, in _forward
x = self._process_transformer_blocks(
x, context, attention_mask, timestep, pe,
...<2 lines>...
**merged_args,
)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 969, in _process_transformer_blocks
vx, ax = block(
~~~~~^
(vx, ax),
^^^^^^^^^
...<16 lines>...
a_prompt_timestep=a_prompt_timestep,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
)
^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 355, in forward
ff_out = self.ff(vx_scaled)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 322, in forward
return self.net(x)
~~~~~~~~^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/container.py", line 253, in forward
input = module(input)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 308, in forward
return torch.nn.functional.gelu(self.proj(x), approximate="tanh")
~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
torch.OutOfMemoryError: Allocation on device 0 would exceed allowed memory. (out of memory)
Currently allocated : 11.33 GiB
Requested : 5.10 GiB
Device limit : 23.51 GiB
Free (according to CUDA): 103.06 MiB
PyTorch limit (set by user-supplied memory fraction)
: 17179869184.00 GiB
[2026-06-07 20:37:42.157] Memory summary:
|===========================================================================|
| PyTorch CUDA memory summary, device ID 0 |
|---------------------------------------------------------------------------|
| CUDA OOMs: 0 | cudaMalloc retries: 0 |
|===========================================================================|
| Metric | Cur Usage | Peak Usage | Tot Alloc | Tot Freed |
|---------------------------------------------------------------------------|
| Allocated memory | 6380 MiB | 16820 MiB | 0 B | 0 B |
| from large pool | 0 MiB | 0 MiB | 0 B | 0 B |
| from small pool | 0 MiB | 0 MiB | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Active memory | 6380 MiB | 16820 MiB | 0 B | 0 B |
| from large pool | 0 MiB | 0 MiB | 0 B | 0 B |
| from small pool | 0 MiB | 0 MiB | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Requested memory | 0 B | 0 B | 0 B | 0 B |
| from large pool | 0 B | 0 B | 0 B | 0 B |
| from small pool | 0 B | 0 B | 0 B | 0 B |
|---------------------------------------------------------------------------|
| GPU reserved memory | 6624 MiB | 21152 MiB | 0 B | 0 B |
| from large pool | 0 MiB | 0 MiB | 0 B | 0 B |
| from small pool | 0 MiB | 0 MiB | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Non-releasable memory | 0 B | 0 B | 0 B | 0 B |
| from large pool | 0 B | 0 B | 0 B | 0 B |
| from small pool | 0 B | 0 B | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Allocations | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Active allocs | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| GPU reserved segments | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Non-releasable allocs | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Oversize allocations | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Oversize GPU segments | 0 | 0 | 0 | 0 |
|===========================================================================|
Here is the current Log with --fast autotune:
aimdo: /project/src/vrambuf.c:75:INFO:VRAM Allocation failed (OOM)
0%| | 0/8 [00:07<?, ?it/s, Model Initializing ... ]
[ERROR] !!! Exception during processing !!! VRAM grow failed: 723919872 bytes
[ERROR] Traceback (most recent call last):
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 536, in execute
output_data, output_ui, has_subgraph, has_pending_tasks = await get_output_data(prompt_id, unique_id, obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 336, in get_output_data
return_values = await _async_map_node_over_list(prompt_id, unique_id, obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/custom_nodes/comfyui-lora-manager/py/metadata_collector/metadata_hook.py", line 171, in async_map_node_over_list_with_metadata
results = await original_map_node_over_list(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
...<2 lines>...
)
^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 310, in _async_map_node_over_list
await process_inputs(input_dict, i)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 298, in process_inputs
result = f(**inputs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_api/internal/__init__.py", line 149, in wrapped_func
return method(locked_class, **inputs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_api/latest/_io.py", line 1880, in EXECUTE_NORMALIZED
to_return = cls.execute(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_extras/nodes_custom_sampler.py", line 1047, in execute
samples = guider.sample(noise.generate_noise(latent), latent_image, sampler, sigmas, denoise_mask=noise_mask, callback=callback, disable_pbar=disable_pbar, seed=noise.seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1316, in sample
output = executor.execute(noise, latent_image, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1254, in outer_sample
output = self.inner_sample(noise, latent_image, device, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1229, in inner_sample
samples = executor.execute(self, sigmas, extra_args, callback, noise, latent_image, denoise_mask, disable_pbar)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 999, in sample
samples = self.sampler_function(model_k, noise, sigmas, extra_args=extra_args, callback=k_callback, disable=disable_pbar, **self.extra_options)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
return func(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/k_diffusion/sampling.py", line 205, in sample_euler
denoised = model(x, sigma_hat * s_in, **extra_args)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 639, in __call__
out = self.inner_model(x, sigma, model_options=model_options, seed=seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1202, in __call__
return self.outer_predict_noise(*args, **kwargs)
~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1209, in outer_predict_noise
).execute(x, timestep, model_options, seed)
~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1212, in predict_noise
return sampling_function(self.inner_model, x, timestep, self.conds.get("negative", None), self.conds.get("positive", None), self.cfg, model_options=model_options, seed=seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 619, in sampling_function
out = calc_cond_batch(model, conds, x, timestep, model_options)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 210, in calc_cond_batch
return _calc_cond_batch_outer(model, conds, x_in, timestep, model_options)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 218, in _calc_cond_batch_outer
return executor.execute(model, conds, x_in, timestep, model_options)
~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 334, in _calc_cond_batch
output = model.apply_model(input_x, timestep_, **c).chunk(batch_chunks)
~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/model_base.py", line 187, in apply_model
return comfy.patcher_extension.WrapperExecutor.new_class_executor(
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...<2 lines>...
comfy.patcher_extension.get_all_wrappers(comfy.patcher_extension.WrappersMP.APPLY_MODEL, transformer_options)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
).execute(x, t, c_concat, c_crossattn, control, transformer_options, **kwargs)
~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/model_base.py", line 231, in _apply_model
model_output = self.diffusion_model(xc, t, context=context, control=control, transformer_options=transformer_options, **extra_conds)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 1069, in forward
return super().forward(
~~~~~~~~~~~~~~~^
x,
^^
...<6 lines>...
**kwargs,
^^^^^^^^^
)
^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 936, in forward
return comfy.patcher_extension.WrapperExecutor.new_class_executor(
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...<4 lines>...
),
~~
).execute(x, timestep, context, attention_mask, frame_rate, transformer_options, keyframe_idxs, denoise_mask=denoise_mask, **kwargs)
~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 989, in _forward
x = self._process_transformer_blocks(
x, context, attention_mask, timestep, pe,
...<2 lines>...
**merged_args,
)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 917, in _process_transformer_blocks
comfy.model_prefetch.prefetch_queue_pop(prefetch_queue, vx.device, block)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/model_prefetch.py", line 62, in prefetch_queue_pop
offload_stream = comfy.ops.cast_modules_with_vbar(comfy_modules, None, device, None, True)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ops.py", line 164, in cast_modules_with_vbar
xfer_dest = get_cast_buffer(dest_size)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ops.py", line 124, in get_cast_buffer
buffer = comfy_aimdo.torch.aimdo_to_tensor(cast_buffer.get(buffer_size, cast_buffer_offset), device)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/comfy_aimdo/vram_buffer.py", line 49, in get
raise RuntimeError(f"VRAM grow failed: {required_size} bytes")
RuntimeError: VRAM grow failed: 723919872 bytes
Here is the oom log with --fast Fp16_accumulation:
[ERROR] !!! Exception during processing !!! Allocation on device 0 would exceed allowed memory. (out of memory)
Currently allocated : 11.33 GiB
Requested : 5.10 GiB
Device limit : 23.51 GiB
Free (according to CUDA): 61.00 MiB
PyTorch limit (set by user-supplied memory fraction)
: 17179869184.00 GiB
[ERROR] Traceback (most recent call last):
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 536, in execute
output_data, output_ui, has_subgraph, has_pending_tasks = await get_output_data(prompt_id, unique_id, obj, input_data_all, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 336, in get_output_data
return_values = await _async_map_node_over_list(prompt_id, unique_id, obj, input_data_all, obj.FUNCTION, allow_interrupt=True, execution_block_cb=execution_block_cb, pre_execute_cb=pre_execute_cb, v3_data=v3_data)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/custom_nodes/comfyui-lora-manager/py/metadata_collector/metadata_hook.py", line 171, in async_map_node_over_list_with_metadata
results = await original_map_node_over_list(
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
...<2 lines>...
)
^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 310, in _async_map_node_over_list
await process_inputs(input_dict, i)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/execution.py", line 298, in process_inputs
result = f(**inputs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_api/internal/__init__.py", line 149, in wrapped_func
return method(locked_class, **inputs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_api/latest/_io.py", line 1880, in EXECUTE_NORMALIZED
to_return = cls.execute(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy_extras/nodes_custom_sampler.py", line 1047, in execute
samples = guider.sample(noise.generate_noise(latent), latent_image, sampler, sigmas, denoise_mask=noise_mask, callback=callback, disable_pbar=disable_pbar, seed=noise.seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1316, in sample
output = executor.execute(noise, latent_image, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1254, in outer_sample
output = self.inner_sample(noise, latent_image, device, sampler, sigmas, denoise_mask, callback, disable_pbar, seed, latent_shapes=latent_shapes)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1229, in inner_sample
samples = executor.execute(self, sigmas, extra_args, callback, noise, latent_image, denoise_mask, disable_pbar)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 999, in sample
samples = self.sampler_function(model_k, noise, sigmas, extra_args=extra_args, callback=k_callback, disable=disable_pbar, **self.extra_options)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/utils/_contextlib.py", line 124, in decorate_context
return func(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/k_diffusion/sampling.py", line 205, in sample_euler
denoised = model(x, sigma_hat * s_in, **extra_args)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 639, in __call__
out = self.inner_model(x, sigma, model_options=model_options, seed=seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1202, in __call__
return self.outer_predict_noise(*args, **kwargs)
~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1209, in outer_predict_noise
).execute(x, timestep, model_options, seed)
~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 1212, in predict_noise
return sampling_function(self.inner_model, x, timestep, self.conds.get("negative", None), self.conds.get("positive", None), self.cfg, model_options=model_options, seed=seed)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 619, in sampling_function
out = calc_cond_batch(model, conds, x, timestep, model_options)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 210, in calc_cond_batch
return _calc_cond_batch_outer(model, conds, x_in, timestep, model_options)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 218, in _calc_cond_batch_outer
return executor.execute(model, conds, x_in, timestep, model_options)
~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/samplers.py", line 334, in _calc_cond_batch
output = model.apply_model(input_x, timestep_, **c).chunk(batch_chunks)
~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/model_base.py", line 187, in apply_model
return comfy.patcher_extension.WrapperExecutor.new_class_executor(
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...<2 lines>...
comfy.patcher_extension.get_all_wrappers(comfy.patcher_extension.WrappersMP.APPLY_MODEL, transformer_options)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
).execute(x, t, c_concat, c_crossattn, control, transformer_options, **kwargs)
~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/model_base.py", line 231, in _apply_model
model_output = self.diffusion_model(xc, t, context=context, control=control, transformer_options=transformer_options, **extra_conds)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 1069, in forward
return super().forward(
~~~~~~~~~~~~~~~^
x,
^^
...<6 lines>...
**kwargs,
^^^^^^^^^
)
^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 936, in forward
return comfy.patcher_extension.WrapperExecutor.new_class_executor(
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
...<4 lines>...
),
~~
).execute(x, timestep, context, attention_mask, frame_rate, transformer_options, keyframe_idxs, denoise_mask=denoise_mask, **kwargs)
~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/patcher_extension.py", line 113, in execute
return self.original(*args, **kwargs)
~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 989, in _forward
x = self._process_transformer_blocks(
x, context, attention_mask, timestep, pe,
...<2 lines>...
**merged_args,
)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 969, in _process_transformer_blocks
vx, ax = block(
~~~~~^
(vx, ax),
^^^^^^^^^
...<16 lines>...
a_prompt_timestep=a_prompt_timestep,
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
)
^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/av_model.py", line 355, in forward
ff_out = self.ff(vx_scaled)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 322, in forward
return self.net(x)
~~~~~~~~^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/container.py", line 253, in forward
input = module(input)
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1778, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^
File "/mnt/Ai-storage/app-comfyui/.venv/lib/python3.13/site-packages/torch/nn/modules/module.py", line 1789, in _call_impl
return forward_call(*args, **kwargs)
File "/mnt/Ai-storage/app-comfyui/ComfyUI/comfy/ldm/lightricks/model.py", line 308, in forward
return torch.nn.functional.gelu(self.proj(x), approximate="tanh")
~~~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
torch.OutOfMemoryError: Allocation on device 0 would exceed allowed memory. (out of memory)
Currently allocated : 11.33 GiB
Requested : 5.10 GiB
Device limit : 23.51 GiB
Free (according to CUDA): 61.00 MiB
PyTorch limit (set by user-supplied memory fraction)
: 17179869184.00 GiB
[INFO] Memory summary:
|===========================================================================|
| PyTorch CUDA memory summary, device ID 0 |
|---------------------------------------------------------------------------|
| CUDA OOMs: 0 | cudaMalloc retries: 0 |
|===========================================================================|
| Metric | Cur Usage | Peak Usage | Tot Alloc | Tot Freed |
|---------------------------------------------------------------------------|
| Allocated memory | 6380 MiB | 16820 MiB | 0 B | 0 B |
| from large pool | 0 MiB | 0 MiB | 0 B | 0 B |
| from small pool | 0 MiB | 0 MiB | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Active memory | 6380 MiB | 16820 MiB | 0 B | 0 B |
| from large pool | 0 MiB | 0 MiB | 0 B | 0 B |
| from small pool | 0 MiB | 0 MiB | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Requested memory | 0 B | 0 B | 0 B | 0 B |
| from large pool | 0 B | 0 B | 0 B | 0 B |
| from small pool | 0 B | 0 B | 0 B | 0 B |
|---------------------------------------------------------------------------|
| GPU reserved memory | 6592 MiB | 21504 MiB | 0 B | 0 B |
| from large pool | 0 MiB | 0 MiB | 0 B | 0 B |
| from small pool | 0 MiB | 0 MiB | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Non-releasable memory | 0 B | 0 B | 0 B | 0 B |
| from large pool | 0 B | 0 B | 0 B | 0 B |
| from small pool | 0 B | 0 B | 0 B | 0 B |
|---------------------------------------------------------------------------|
| Allocations | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Active allocs | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| GPU reserved segments | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Non-releasable allocs | 0 | 0 | 0 | 0 |
| from large pool | 0 | 0 | 0 | 0 |
| from small pool | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Oversize allocations | 0 | 0 | 0 | 0 |
|---------------------------------------------------------------------------|
| Oversize GPU segments | 0 | 0 | 0 | 0 |
|===========================================================================|
[ERROR] Got an OOM, unloading all loaded models.
Custom Node Testing
Expected Behavior
Vram to be utilized correctly
Actual Behavior
A few weeks ago I had updated my GPU driver from 590.48.1 to 595.xx.yy, when I tried to run a standard LTx 2.3 workflow that I had ran minutes before updating the driver, my generation times went from 90-200 seconds to minutes before the ksampler even showed progress. Watching my systems resource monitor I noticed that comfyui was streaming data from cache instead of reading the model, vae, and clip then loading them to vram.
Thought it might be torch related and needed to be upgraded, so I upgraded torch from 2.10 + cu130 to 2.12 + cu132, rebuild sageattention wheels, and fixed any other issues with my uv environment, yet vram was still not running correctly. Even updating comfyui and restarting pc had no effect. At that point I rolled my driver back to 590 and everything worked just fine.
Fast forward to june 6/7, I was using musubi trainer to train an ltx 2.3 lora in it own uv environment, the training finished up and I went to test the lora, suddenly with nothing loaded in Vram, 52GB of ram available, and 92GB of swap file free, comfyui is ooming. I removed the lora, comfyui still ooms, I restart my pc and comfyui, still oom.
I try to run LTx 2.3 wf again, but I monitor everything with kijais memory visualizer, looks like torch loads stuff to vram but then pins WAY more than it needs and ooms after it attempts to offload everything. I go through and try other workflows that I know take a lot memory, Klein no issues, Wan 2.2 no issues unless I set the resolution higher.
Currently I'm playing around with --fast launch options when this happened, both times I had --fast cublas_ops
Steps to Reproduce
I don't have a good steps to reproduce, each time its usually been while training loras.
the first time when rolling back drivers fixed the issue, I was using Rs-nodes lora training nodes.
This time I'm using musubi tuner in a seperate uv enviroment from comfyui, and driver 590.48.1
Debug Logs
Other
Just in case the questions come up:
OS Linux Mint 22.3 (ubuntu 24.04)
Nvidia 4090 driver 490.48.1
system ram 64GB
swap file 99GB
comfyUI version 24
aimdo version 4.0