-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Where can I find the the-stack-v2-train-extras and LHQ datasets? #7
Comments
Hi, all the extras will be available in a few weeks along with the stack v2's content |
Hi, any updates for |
@loubnabnl Any updates? |
Hi, any updates? |
@loubnabnl @bigximik @anton-l @iNeil77 @lvwerra Hi, any updates? |
@loubnabnl Hi, any updates? |
1 similar comment
@loubnabnl Hi, any updates? |
Hello, is this still on your release schedule? |
HI! aNY UPDATE?11 |
Still no updates? Bigcode seems not working well these days... Edit: maybe incorrect. I guess this one is the dataset they used: https://huggingface.co/datasets/bigcode/stack-exchange-preferences-20230914-clean-anonymization |
Thanks for your wonderful work! In https://huggingface.co/datasets/bigcode/the-stack-v2-dedup, I can only find the-stack-v2-train-smol and the-stack-v2-train-full data. I'm wondering where can I find the the-stack-v2-train-extras and LHQ datasets? Do you have a plan to release it?
The text was updated successfully, but these errors were encountered: