Thank you for creating and sharing this great tool!
I'm planning to use it for my research project. I wonder if there is any reference for computing and creating attention head activation frequency figures like the one below. Or any pointers on how to compute the importance subgraph over a batch of sentences with python code (without the web interface)?
Many thanks!
(Figure 7 in https://arxiv.org/abs/2403.00824)