Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Preprocessing Code #2

Open
lousiming opened this issue Nov 23, 2024 · 4 comments
Open

Preprocessing Code #2

lousiming opened this issue Nov 23, 2024 · 4 comments

Comments

@lousiming
Copy link

Could you please help me? I followed your preprocessing code and used the data format from the h5py file, but the results of my training are not very good. Could you provide some code for creating an h5py file? Thank you.

@thswodnjs3
Copy link
Owner

I'm very sorry but which data do you mean? The public dataset (SumMe, TVSum) or your custom dataset?

@lousiming
Copy link
Author

I would like to know the code for processing two datasets, as they are quite different from each other. I haven't found any code online that handles both datasets. I went through the content in the h5 file and processed it myself, but when I trained the model, the scores were very low. I suspect that there might be an issue with my code or that the dataset I downloaded is slightly incomplete. Thank you for your reply. I am a student from China, and I would greatly appreciate your help in solving this problem.

@thswodnjs3
Copy link
Owner

thswodnjs3 commented Nov 23, 2024

I understand that you want to know two things.
1)The code preprocesses two public datasets and creates an h5py file.
2)Why is there a different result when you preprocess data yourself.

1)I've found that there is something quite similar.

https://github.com/li-plus/DSNet?tab=readme-ov-file#using-custom-videos

You can reference the link above.
But as you know it is not the official code and I haven't found the official code too.
So you need to consider it.

2)I suppose it is because the current h5py files were created using GoogleNet which had been implemented differently.
I'm not sure but I guess Ke Zhang (the author of vsLSTM, ECCV 2016) created h5py files using two datasets.
From his GitHub repository, he said that he used GoogleNet which was implemented by Caffe, not PyTorch.
Because of different implementations for GoogleNet, you might see different results.
You can reference below.

kezhang-cs/Video-Summarization-with-LSTM#2

@lousiming
Copy link
Author

Okay, thank you for your answer, I really appreciate it! I will take a look at the code first.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants