Invalid kwarg fused passed to bitsandbytes AdamW8bit

Hi, when running the following command:
```tune run lora_finetune_single_device --config llama3/8B_lora_single_device model.lora_rank=16 optimizer=bitsandbytes.optim.AdamW8bit gradient_accumulation_steps=4 tokenizer.max_seq_len=2048 max_steps_per_epoch=100 model.lora_attn_modules="['q_proj','k_proj','v_proj','output_proj']" model.apply_lora_to_mlp=True log_peak_memory_stats=True compile=True checkpointer.checkpoint_dir=checkpoints/original tokenizer.path=checkpoints/original/tokenizer.model checkpointer.output_dir=checkpoints/original```

Which returns this [stack trace](https://gist.github.com/mlazos/0cd3465329f8fc6920b9df9374100c79).



It looks like we unconditionally pass `fused` as a kwarg to the optimizer even though [the bits and bytes optimizer doesn't have this kwarg](https://github.com/bitsandbytes-foundation/bitsandbytes/blob/a676f6ed6de3468a2c4bd9a9a9057ee6c3e2282c/bitsandbytes/optim/adamw.py#L10C1-L22C24)

Related issue:https://github.com/pytorch/torchtune/issues/1998

Version info:
Pytorch: 1b3f8b75896720e88362cbec7db32abc52afa83e
Torchtune: f2bd4bc25b24587aef40f486087412b9da8f1d94
Torchao: 039cef4ad546716aa04cd54c461feb173f7fe403



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Invalid kwarg fused passed to bitsandbytes AdamW8bit #2152

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development