I do not understand _normalize_attentions method in  API/API_CLIP/hook.py

I'm having some trouble understanding a part of the _normalize_attentions function. Specifically, I'm unsure about the following line of code:

mean_centered = (self.attentions - self.post_ln_mean[:, :, np.newaxis, np.newaxis] / (len_intermediates * normalization_term))

In this context, len_intermediates is set to 47 when _normalize_attentions is called. Could someone explain in detail what this code is doing? In particular, I'm unclear on why we divide by len_intermediates.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I do not understand _normalize_attentions method in API/API_CLIP/hook.py #8

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

I do not understand _normalize_attentions method in API/API_CLIP/hook.py #8

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions