The architecture in the implementation is not consistent with what was described in the paper

Hi,

I notice there is a consistency between the implementation and what you have described in the paper.
In your paper, the self-attention module is used to self-attend between keypoint features, which are extracted from support features; and the cross-attention module is used to cross-attend resulting keypoint features to query features.

However, in the implementation, the self-attention  module is used to self-attend between query features: 
https://github.com/luminxu/Pose-for-Everything/blob/b28951b6b24d17df35b8d53da446b08a68667051/pomnet/models/keypoint_heads/transformer_head.py#L89

And the cross-attention module is used to cross-attend resulting query features to the keypoint features:
https://github.com/luminxu/Pose-for-Everything/blob/b28951b6b24d17df35b8d53da446b08a68667051/pomnet/models/keypoint_heads/transformer_head.py#L99

In this file, x is the query features, and query_embed is in fact the keypoint features.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The architecture in the implementation is not consistent with what was described in the paper #13

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

The architecture in the implementation is not consistent with what was described in the paper #13

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions