title: Understanding Multimodal LLMs: the Mechanistic Interpretability of Llava in Visual Question Answering link: https://arxiv.org/pdf/2411.10950