One-layer MLP Possibly Missing

The attention layer works directly on the GRU embeddings (denoted by h_it in the HAN paper) in the call function of the AttentionLayer. In the paper description, h_it should be fed to a one-layer MLP with a tanh activation to obtain u_it by u_it = tanh(W.h_it + b). The attention weights are then computed on u_it. Is this happening in the code and I have missed it out, or has this been (intentionally) left out? Please clarify.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

One-layer MLP Possibly Missing #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

One-layer MLP Possibly Missing #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions