You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The SEACrowd Instruct Multi-task Collection is a multi-task, instruction-formatted compilation of 29 preexisting datasets from SEACrowd. The collection comprises 332,040 question-answer pairs, to integrate the permissively licensed datasets in SEACrowd into the Data Provenance Explorer tool as well as multi-task instruction fine-tuning.
Subsets
Bhinneka Korpus - Translated, CebuaNER - Translated, EMoTES-3K - Translated, Fake News Filipino - Translated, Filipino Hate Speech Tiktok - Text - Translated, Filipino Slang Spelling Normalization - Translated, id-vaccines-tweets - Translated, identifikasi-bahasa - Translated, IndoNER-Tourism - Translated, Indonesia-Chinese-MTRobustEval - Translated, LimeSoda - Translated, MABL - Translated, MKQA - Translated, Multilabel Multiclass Sentiment and Emotion Dataset from Indonesian Mobile Application Review - Translated, Myanmar (Burmese) Name Romanization with Alignment on Grapheme-Level - Translated, NTREX-128 - Translated, SPAMID-PAIR - Translated, Tagalog Profanity Dataset - Translated, Thai Depression Dataset - Translated, Thai Toxicity Tweet Corpus - Translated, Typhoon Yolanda Tweets - Translated, UIT-ViCTSD - Translated, UIT-ViOCD - Translated, Vietnamese Social Media Emotion Corpus (UIT-VSMEC) - Translated, ViHealthQA - Translated, Wisesight Thai Sentiment Corpus - Translated, Wongnai Reviews - Translated, XNLI - Translated, XStoryCloze - Translated
Dataloader name:
seacrowd_instruct/seacrowd_instruct.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?seacrowd_instruct
The text was updated successfully, but these errors were encountered: