Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request for external caption files #52

Open
mmderakhshani opened this issue Nov 19, 2024 · 5 comments
Open

Request for external caption files #52

mmderakhshani opened this issue Nov 19, 2024 · 5 comments

Comments

@mmderakhshani
Copy link

mmderakhshani commented Nov 19, 2024

Hi there,

Thank you for sharing this excellent GitHub repository.

The Laiona-aesthetic-12m and JourneyDB datasets have been recaptioned using the ShareGPT4V model in both the second and third stages of training.

We are working on reproducing your results and have successfully completed the first stage. To continue with the training, we would like to request the following three annotations:

external_journeydb_caption_path: "/mnt/bn/vgfm2/test_mlx/xavier/code/3062/open_muse/train_journeydb_anno.json"

external_laion12m_caption_path: "/mnt/bn/vgfm/laion5b/laion-aesthetics-12m-captions"

and

/mnt/bn/vgfm2/test_dit/LlmDiffuser_phi1.5/LlmDiffuser/questions.json

Could you please share these items with us as they are blocking our reproduction of your GitHub repo?

If sharing these files is not possible, could you provide the code to regenerate them at least? This way, we can handle the recaptioning internally. Much appreciated.

@Sierkinhane
Copy link
Collaborator

Sierkinhane commented Nov 20, 2024

Hi, you can find journeydb annotation here and questions.json in the directory ./training. For laion12m, you can recaption it using the off-the-shelf MLLMs like Qwen series or ShareGPT-V.

@mmderakhshani
Copy link
Author

Perfect, thanks a lot for this. Could you please let me know what your prompt is for recaptioning?

@Sierkinhane
Copy link
Collaborator

Hi, maybe you can try "Describe this image and its style in a very detailed manner” or “Describe this image in as much detail as possible”.

@mmderakhshani
Copy link
Author

Perfect. Thanks for this. I will try and get back to you if you do not mind.

@Sierkinhane
Copy link
Collaborator

Feel free to ask :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants