Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hi! have you tried implement MVCNN based on VGG 19 not the Alexnet? #12

Open
yifanfeng97 opened this issue Aug 20, 2017 · 6 comments
Open

Comments

@yifanfeng97
Copy link

yifanfeng97 commented Aug 20, 2017

I used Tensorflow, and i was besed on VGG19. But I just got the OOM error! I had reused the variable in the loop! I used TITAN X.

@youkaichao
Copy link
Contributor

I was also wandering.
In the paper, the author use VGG-M and VGG-VD. So why many people implement it using AlexNet?(the Caffe implementation uses AlexNet as well)

@WeiTang114
Copy link
Owner

@youkaichao
My reason is I had difficulty finding usable pretrained weight for VGG-M (and can be converted for Tensorflow) when I began this project 😅.
(VGG-M was implemented in matconvnet)

@WeiTang114
Copy link
Owner

@XDUfyf
I haven't tried VGG19. Did you try reducing the batch size?

@yifanfeng97
Copy link
Author

yifanfeng97 commented Aug 21, 2017

@WeiTang114 I have tried reducing the batch size, and I set batch size with 1. It did not work, too.

At the same time, I met a strange problem.
when I tried train with single view not multi-view(based on VGG19, my multi-view didn't work. so I
temporarily use single view as input ) in ModelNet40. If I set batch size as 64 ,training would get nan in step 404. however I set batch size as 16, training would get nan in step 1619( 64 = 164 and 1619 = 4044). No matter how many times I run. It would get same result. My loss curve is normal.
environment: Tensorflow, Python3, TITAN X , Cuda 8.

loss

error 1

error3

@paulcx
Copy link

paulcx commented Sep 5, 2017

The resnet 101 is definitely a good choice to try for better performance.

@foxfromworld
Copy link

Hi,
Can we train our model from scratch without the pretrained AlexNet model?
Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants