Skip to content

Question about the input of ROIAlign #10

@hangzhiyiwei

Description

@hangzhiyiwei

Hi

I have a question about the input image feature size here
https://github.com/NVlabs/Bongard-HOI/blob/master/models/rn_bbox_encoder.py#L116

The input feature size for one image is 25611 (depthheightwidth). If so, how can ROIAlign crop proper ROI region feature? I have seen the output of this self.roi_pooler (with size 25677), and I found that many roi_feats are all zero outputs.
For others, for each channel of the whole 256 channels, only the upper left element has a number. An example shows below.
[ 0.4648, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000]

I am really confused about it. Sorry to bother. Any reply will be appreciated. Thanks very much.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions