Incorrectly implemented MultiHeadAttention in PseTae

The implementation of the PseTae MultiHeadAttention appears to have a mistake.

The original implementation applies 2 fully connected layers on the query tensor (see here: https://github.com/VSainteuf/pytorch-psetae/blob/master/models/tae.py#L133)

However, in the eo-flow implementation, while both fully connected layers are defined, the 2nd (defined here: https://github.com/sentinel-hub/eo-flow/blob/master/eoflow/models/pse_tae_layers.py#L50) is not used as would be expected here: https://github.com/sentinel-hub/eo-flow/blob/master/eoflow/models/pse_tae_layers.py#L66 (indeed, it is not used at all in the code).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrectly implemented MultiHeadAttention in PseTae #40

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Incorrectly implemented MultiHeadAttention in PseTae #40

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions