Skip to content
/ ReAct Public

A Review Comment Dataset for Actionability and fine grained Taxonomy of Labels

License

Notifications You must be signed in to change notification settings

gtmdotme/ReAct

Repository files navigation

ReAct

ReAct is a corpus of 1.2k+ carefully curated review comments extracted from ICLR papers reviewed at OpenReview for the year 2018, with human annotations to two types of labels. The review comments were randomly selected from a large pool of English language peer reviews. The annotations were performed by Mechanical Turk Masters located in the U.S. Demographic information about the reviewers and annotators is unknown and may not be representative of global diversity.

Below is a summary of the dataset statistics:

Parameters Values
Number of examples 1,250
Number of 'Label1' classes 2
Number of 'Label2' classes 7
Average token length of text ~23
No. of samples w/ 4+ raters agreeing on 'Label1' 988 (79.04%)
No. of samples w/ 4+ raters agreeing on 'Label2' 876 (70.08%)

The 'Label1' categories are: actionable, non_actionable.
The 'Label2' categories are: agreement, disagreement, suggestion, question, shortcoming, fact and other.

Data

Our processed dataset includes all annotations for a single review comment aggregated over all the turkers. Each row represents an annotation for a single review comment. This file includes the following columns:

ReviewId: The unique id of the review.
SentenceId: The unique id of sentence within a review.
Text: The text of the review sentence (or comment).
Label1: The annotated class for 'Label1'.
Label2: The annotated class for 'Label2'.
set: The (train-test) split, a comments belongs to.

Our raw annotated dataset includes all annotations as well as metadata on the comments. Each row represents a single rater's annotation for a single review. This file includes an addition column:

TurkerId: The unique id of the turker.

Finally, the unlabelled dataset includes all the unlabelled review comments (52k+), a subset of which is used for annotation. Each new line represents a review comment.

Contact

Gautam Choudhary, Natwar Modani, Nitish Maurya

Citation

This work is accepted at International conference on Web Information Systems Engineering (WISE), 2021. If you use this dataset for your publication, please cite the original paper:

@inproceedings{choudhary2021react,
  title={ReAct: A Review Comment Dataset for Actionability (and more)},
  author={Choudhary, Gautam and Modani, Natwar and Maurya, Nitish},
  booktitle={International Conference on Web Information Systems Engineering},
  pages={336--343},
  year={2021},
  organization={Springer}
}

License Information

Creative Commons License
ReAct is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.

NOTE: Please refer to the LICENSE file for detailed information.

About

A Review Comment Dataset for Actionability and fine grained Taxonomy of Labels

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published