I tried reproducing the results in your paper by training the unlearning methods (GA, GD, ICD, IDK, and KL) using the same experimental settings described in the paper. I used the released pretrained weights(https://huggingface.co/gray311/vlm_unlearning_ft_llava_phi_3_mini) and followed the configurations as closely as possible. However, I observed that the results differ from those reported in the paper.
I also checked the configuration files on Huggingface and noticed that both tune_mm_projector and tune_language_model are set to true during LoRA fine-tuning. Could you please clarify whether, during unlearning, you updated not only the LoRA adapters but also the MM projector and the LLM layers?
I tried reproducing the results in your paper by training the unlearning methods (GA, GD, ICD, IDK, and KL) using the same experimental settings described in the paper. I used the released pretrained weights(https://huggingface.co/gray311/vlm_unlearning_ft_llava_phi_3_mini) and followed the configurations as closely as possible. However, I observed that the results differ from those reported in the paper.
I also checked the configuration files on Huggingface and noticed that both tune_mm_projector and tune_language_model are set to true during LoRA fine-tuning. Could you please clarify whether, during unlearning, you updated not only the LoRA adapters but also the MM projector and the LLM layers?