Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Inquiry About Issues and Pull Requests Data in The Stack V2 Dataset #13

Open
xiuxiu opened this issue Oct 22, 2024 · 0 comments
Open

Inquiry About Issues and Pull Requests Data in The Stack V2 Dataset #13

xiuxiu opened this issue Oct 22, 2024 · 0 comments

Comments

@xiuxiu
Copy link

xiuxiu commented Oct 22, 2024

Hello,

I hope this message finds you well. I noticed that the recently released The Stack V2 dataset does not include issues and pull request data. I am interested in understanding whether there are any plans to incorporate this information in future releases.

Having access to issues and pull requests would significantly enhance the dataset's utility for research and analysis. Any updates or insights you could provide would be greatly appreciated.

Thank you for your work on this project!

NOTE: repo_licenses_s3 and commit_paris_files_s3 will be released later and we reccomend compilin your own sets for up to date information, those data sets are compiled in other parts of SC2 data pipeline. opt_outs_dataset_name will not be release as it is confidential data, so it is needed to compile such data for your project. Please ask on BigCode comunty genral forums on Slack for more details.

Best regards,
xiuxiu

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant