This is the repo for the TutorialBank Corpus.
In this update, we include resources-v2023-clean.tsv, containing 20243 resources with valid URL, meta-data and well-annotated topics, including around 500 new resources annotated recently.
In this update, we include resources-v2022-clean.tsv, containing 19765 resources with valid URL, meta-data and well-annotated topics.
We also release an extra batch of 5001 resources resources-v2022-extra.tsv. These resources have valid URLs but some meta-data or topic annotation is missing.
Pervious version are under the TB-Paper/data
folder.
If you want to replicate our 2018 ACL paper TutorialBank: A Manually-Collected Corpus for Prerequisite Chains, Survey Extraction and Resource Recommendation, please visit the TB-Paper
folder for the coresponding source code and data.
Please visit our website AAN.how.