Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

VideoChat2 Raw Training video download #176

Closed
mingzeG opened this issue May 17, 2024 · 14 comments
Closed

VideoChat2 Raw Training video download #176

mingzeG opened this issue May 17, 2024 · 14 comments

Comments

@mingzeG
Copy link

mingzeG commented May 17, 2024

Great Work! I was hoping to quickly get the raw video dataset used and try to train a videochat2, how could I get a filtered raw video dataset rather than downloading all the video datasets that the data.md mentioned. It's too memory consuming

@yinanhe
Copy link
Member

yinanhe commented May 20, 2024

Most of the videos are common and relatively easy to obtain. For some datasets that are more difficult to access, for EgoQA videos, download them from this link. For VideoChat2 conversation videos, download them from link. For Youcook, you can download it from link

@pengzhiliang
Copy link

Hello, @yinanhe, how to download the CLEVRER dataset? The link shows noting.

@yinanhe
Copy link
Member

yinanhe commented May 21, 2024

@pengzhiliang
Copy link

pengzhiliang commented May 21, 2024

@yinanhe Thanks very much 🍻

@schopra8
Copy link

@yinanhe Do you have alternative links? The videos links aren't working for me.

@yinanhe
Copy link
Member

yinanhe commented May 29, 2024

@schopra8 We no longer have any other links. If you are still having difficulties in getting the data, please let me know which dataset it is.

@schopra8
Copy link

Apologies for the lack of specificity! I mean the CLEVRER dataset specifically. The question+answers and README load for me -- but the videos don't, when I click on those links.

I reached out to the original authors of the CLEVRER dataset as well, but they haven't responded.

@schopra8
Copy link

@yinanhe -- I'm also struggling to find the VideoChat videos used in the conversation and caption annotations. Do you have any pointers to where I can download this data? Thank you in advance!

@yinanhe
Copy link
Member

yinanhe commented May 29, 2024

Apologies for the lack of specificity! I mean the CLEVRER dataset specifically. The question+answers and README load for me -- but the videos don't, when I click on those links.

I reached out to the original authors of the CLEVRER dataset as well, but they haven't responded.

@schopra8 how about use link in #176 (comment). In my network environment, the download is normal.

@yinanhe
Copy link
Member

yinanhe commented May 29, 2024

@yinanhe -- I'm also struggling to find the VideoChat videos used in the conversation and caption annotations. Do you have any pointers to where I can download this data? Thank you in advance!

The videos used here are from YouTube, and some videos might be found in the InternVid dataset on opendatlab.com.

@schopra8
Copy link

Thank you @yinanhe!

@schopra8
Copy link

I now see that the VideoChat data names correspond to WebVid like VideoChat2 -- and can resolve the videos.

@yinanhe - Thanks for all the help! It might be helpful to update the Data.md file to clarify that VideoChat corresponds to videos from WebVid. The link to InternVid threw me off -- and had me incorrectly looking at the InternVid-10M Dataset .

@xiaoqian-shen
Copy link

Most of the videos are common and relatively easy to obtain. For some datasets that are more difficult to access, for EgoQA videos, download them from this link. For VideoChat2 conversation videos, download them from link. For Youcook, you can download it from link

Hi, the links seems not work. Could you check whether the bucket exists or not?

@yinanhe
Copy link
Member

yinanhe commented Oct 14, 2024

Now you can get the latest download link according to the issue #223

egoqa conversation youcook_split_videos_parta youcook_split_videos_partb

@yinanhe yinanhe closed this as completed Oct 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants