ReAct is a corpus of 1.2k+ carefully curated review comments extracted from ICLR papers reviewed at OpenReview for the year 2018, with human annotations to two types of labels. The review comments were randomly selected from a large pool of English language peer reviews. The annotations were performed by Mechanical Turk Masters located in the U.S. Demographic information about the reviewers and annotators is unknown and may not be representative of global diversity.
Below is a summary of the dataset statistics:
Parameters | Values |
---|---|
Number of examples | 1,250 |
Number of 'Label1' classes | 2 |
Number of 'Label2' classes | 7 |
Average token length of text | ~23 |
No. of samples w/ 4+ raters agreeing on 'Label1' | 988 (79.04%) |
No. of samples w/ 4+ raters agreeing on 'Label2' | 876 (70.08%) |
The 'Label1' categories are: actionable, non_actionable.
The 'Label2' categories are: agreement, disagreement, suggestion, question, shortcoming, fact and other.
Our processed dataset includes all annotations for a single review comment aggregated over all the turkers. Each row represents an annotation for a single review comment. This file includes the following columns:
ReviewId: The unique id of the review.
SentenceId: The unique id of sentence within a review.
Text: The text of the review sentence (or comment).
Label1: The annotated class for 'Label1'.
Label2: The annotated class for 'Label2'.
set: The (train-test) split, a comments belongs to.
Our raw annotated dataset includes all annotations as well as metadata on the comments. Each row represents a single rater's annotation for a single review. This file includes an addition column:
TurkerId: The unique id of the turker.
Finally, the unlabelled dataset includes all the unlabelled review comments (52k+), a subset of which is used for annotation. Each new line represents a review comment.
Gautam Choudhary, Natwar Modani, Nitish Maurya
This work is accepted at International conference on Web Information Systems Engineering (WISE), 2021. If you use this dataset for your publication, please cite the original paper:
@inproceedings{choudhary2021react,
title={ReAct: A Review Comment Dataset for Actionability (and more)},
author={Choudhary, Gautam and Modani, Natwar and Maurya, Nitish},
booktitle={International Conference on Web Information Systems Engineering},
pages={336--343},
year={2021},
organization={Springer}
}
ReAct is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
NOTE: Please refer to the LICENSE file for detailed information.