-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transfer learning on a large dataset #50
Comments
Hello :) If you are adding the new DocTamper dataset (120k images) to the existing dataset setup, the smallest dataset is still IMD, so self.smallest should be 1869 (the number of images in IMD). |
Thank you very much for your answer. I'm only using DocTamper as a dataset. My validation set and testing set are 10,000 and 30,000 images respectively, considering the training efficiency and training performance, i want to kown whether I set self.smallest to 10,000 or 1869*10 will get better results.Looking forward to your answer. |
Hello, I am also training Catnet with Doctamper. Can you leave me a contact information to discuss together? Thank you |
Hey. I want to try the same. Can we connect to discuss? |
Hello author. I tried to train CAT-Net on the DocTamper dataset (120000 images). I look forward to your answer as to whether I should change self.smallest = 1869 to self.smallest = 120000 in the data_core.py, or should I train with a subset of the full dataset in each round.
The text was updated successfully, but these errors were encountered: