- Fixed misaligned answers such that
answer_start
corresponded totext
. - Removed questions marked unanswerable by two or more answer reviewers.
- Fixed answer spans to be whole words, surrounded by space or punctuation characters.
- Fixed evaluation script to improve text normalization before scoring.
- Fixed space issues in answers and contexts.
- Added additional review answers for some questions.
articles: 48 paras: 2067 qs: 10570 as: 34726
- First release of dataset
articles: 48 paras: 2067 qs: 10600 as: 33615