Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can you offer the link for downloading the training data and validation data? Thank you very much #11

Open
young169 opened this issue Sep 30, 2020 · 21 comments

Comments

@young169
Copy link

No description provided.

@young169
Copy link
Author

And what's more, the link for testdata: https://drive.google.com/uc?export=download&id=1z_fD3UKgWQyOTZIBbYSaQ-hz4AzUrLC1 failed somehow..

@PeterWang512
Copy link
Owner

The link for training data: https://drive.google.com/file/d/1iVNBV0glknyTYGA9bCxT_d0CVTOgGcKh/view
The link for validation data: https://drive.google.com/file/d/1FU7xF8Wl_F8b0tgL0529qg2nZ_RpdVNL/view

Would you mind trying downloading the files manually? I tested it and it still worked for me.

@liulai
Copy link

liulai commented Oct 14, 2020

The link for training data: https://drive.google.com/file/d/1iVNBV0glknyTYGA9bCxT_d0CVTOgGcKh/view
The link for validation data: https://drive.google.com/file/d/1FU7xF8Wl_F8b0tgL0529qg2nZ_RpdVNL/view

Would you mind trying downloading the files manually? I tested it and it still worked for me.

training data is not worked, validation data is worked
QQ图片20201014224725

@young169
Copy link
Author

The link for training data: https://drive.google.com/file/d/1iVNBV0glknyTYGA9bCxT_d0CVTOgGcKh/view
The link for validation data: https://drive.google.com/file/d/1FU7xF8Wl_F8b0tgL0529qg2nZ_RpdVNL/view
Would you mind trying downloading the files manually? I tested it and it still worked for me.

training data is not worked, validation data is worked
QQ图片20201014224725

And once the connection is break, we can only download it from the beginning

@Queuecumber
Copy link

Can you please host the data somewhere else? Google drive isn't really made for this kind of high traffic stuff:

Sorry, you can't view or download this file at this time.

Too many users have viewed or downloaded this file recently. Please try accessing the file again later. If the file you are trying to access is particularly large or is shared with many people, it may take up to 24 hours to be able to view or download the file. If you still can't access a file after 24 hours, contact your domain administrator.

Berkeley must have a place for you to host it

@PeterWang512
Copy link
Owner

I see, I'll work on a new download link soon. Sorry that this causes trouble

@PeterWang512
Copy link
Owner

PeterWang512 commented Oct 22, 2020

Sorry for the wait. Currently I'm working on storing the file into a different cloud server, and will update the link when it's available.
In the meantime, I've made a temporary link with splitted training data here: https://drive.google.com/drive/u/2/folders/14E_R19lqIE9JgotGz09fLPQ4NVqlYbVc
Let me know if this works for you, and I'll let you know when a new link is available.

@PeterWang512
Copy link
Owner

Also made a copy in Box, here's the link: https://cmu.app.box.com/folder/124997172518?s=4syr4womrggfin0tsfhxohaec5dh6n48

@young169
Copy link
Author

Also made a copy in Box, here's the link: https://cmu.app.box.com/folder/124997172518?s=4syr4womrggfin0tsfhxohaec5dh6n48

Thanks, Box version works for me!

@pratyushmaini
Copy link

Hey! Can you explain to me the tool you are using to unzip this? I have used multiple tools and the extraction is not happening in the desired format. The exact command you use would be helpful :)

@PeterWang512
Copy link
Owner

I'm using unzip in ubuntu.

@pratyushmaini
Copy link

pratyushmaini commented Sep 30, 2021

This does not work for me on the command line. are you sure this is working in the command line

I have also tried other methods like 7za but they do not extract it correctly (the directory structure gets muddled)

@PeterWang512
Copy link
Owner

Would you mind trying the script here to see if it works? https://github.com/PeterWang512/CNNDetection#download-the-dataset
I tried it again and it works on my end.

@pratyushmaini
Copy link

pratyushmaini commented Oct 10, 2021

: ~/projects/CNNDetection> $ cd dataset/test
: ~/projects/CNNDetection/dataset/test> $ ls
download_testset.sh
: ~/projects/CNNDetection/dataset/test> $ bash download_testset.sh
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
100 3118 0 3118 0 0 7913 0 --:--:-- --:--:-- --:--:-- 7893
% Total % Received % Xferd Average Speed Time Time Time Current
Dload Upload Total Spent Left Speed
0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0
100 89936 0 89936 0 0 266k 0 --:--:-- --:--:-- --:--:-- 266k
Archive: testset.zip
End-of-central-directory signature not found. Either this file is not
a zipfile, or it constitutes one disk of a multi-part archive. In the
latter case the central directory and zipfile comment will be found on
the last disk(s) of this archive.
unzip: cannot find zipfile directory in one of testset.zip or
testset.zip.zip, and cannot find testset.zip.ZIP, period.

@PeterWang512
Copy link
Owner

Never seen this error before. Doing a quick google search found this: https://askubuntu.com/questions/54904/unzip-error-end-of-central-directory-signature-not-found

@pratyushmaini
Copy link

As you can see in the comment I posted, the script downloads just 89936KB of data which is clearly not the test set.

@PeterWang512
Copy link
Owner

PeterWang512 commented Oct 11, 2021

Sorry but not sure if I can help any further if I cannot reproduce the error. Will look into this further if you can provide the exact details for others, including me, to reproduce this.

@pratyushmaini
Copy link

Did you try for the train set or for val and test? Now val and test work, but train does not. I think its an issue with where it is hosted that it allows downloading sometimes.

@PeterWang512
Copy link
Owner

Yes, please try this link and this link for training images. These are the mirrors of the dataset

@pratyushmaini
Copy link

Can you tell me the command such as this to do that?
https://stackoverflow.com/questions/46239248/how-to-download-a-file-from-box-using-wget
This and many other links don't work.

@PeterWang512
Copy link
Owner

PeterWang512 commented Oct 26, 2021

Sorry I also haven't figured out a way to download from Box using bash scripts. Would it be possible to download it manually first? Meanwhile, I'll look for other ways to host or download the dataset

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants