Question about the input of ROIAlign #10

hangzhiyiwei · 2022-12-05T03:28:40Z

Hi

I have a question about the input image feature size here
https://github.com/NVlabs/Bongard-HOI/blob/master/models/rn_bbox_encoder.py#L116

The input feature size for one image is 25611 (depthheightwidth). If so, how can ROIAlign crop proper ROI region feature? I have seen the output of this self.roi_pooler (with size 25677), and I found that many roi_feats are all zero outputs.
For others, for each channel of the whole 256 channels, only the upper left element has a number. An example shows below.
[ 0.4648, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000],
[ 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000, 0.0000]

I am really confused about it. Sorry to bother. Any reply will be appreciated. Thanks very much.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the input of ROIAlign #10

Question about the input of ROIAlign #10

hangzhiyiwei commented Dec 5, 2022

Question about the input of ROIAlign #10

Question about the input of ROIAlign #10

Comments

hangzhiyiwei commented Dec 5, 2022