Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Image Feature Map #7

Open
yixiao1 opened this issue Sep 16, 2022 · 1 comment
Open

Image Feature Map #7

yixiao1 opened this issue Sep 16, 2022 · 1 comment

Comments

@yixiao1
Copy link

yixiao1 commented Sep 16, 2022

Hi,

Congrat to this impressive work.

I have a question relating to the image feature map. In the paper, you mentioned that you used ResNet34 pretrained on ImageNet as the image encoder. Could you please provide more details about the layers that you have used? Did you remove the last global average pooling and FC layers of the ResNet backbone?

I assume that you finally encoded each input image to only one feature maps F, since later you calculated an attention map and mapped it back to this feature map F for each step. If so, you should have added some decoder layers after the last Conv. layers block of the ResNet-34, right? Please correct me if I understand it wrongly. Thanks!

Best wishes

@jiaxiaosong1002
Copy link
Collaborator

Hi yixiao1,

Since the code have been released, you could find out that we use both the 2D feature map and flattened one of ResNet34. No additional layers are used.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants