Verbal-MWE-annotations

We provide users with annotations of verbal multiword expressions (VMWEs) on English Ontonotes. Regarding methods we exploit for annotations, please refer to [1].

Files

VMWE annotations (vmwe_indices.txt)

Indices of components of VMWEs in sentences.

2 tab-separated columns:

MWE (e.g., "break silence")
Instance_id (e.g., wsj_1279_1:12_17)

A format of instance id is [filename]_[sentence_id]:[token_indices].
- Filename (e.g., wsj_1279)
- Sentence ID (0-origin)
- Token indices of components of a VMWE (1-origin)

Dependency (ontonotes_wsj_00_24_masked.conll)

Stanford dependencies [2] of the Wall Street Journal portion of Ontonotes Release 5.0 (LDC2013T19).

Conll Format

1 token per line, with blank lines separating sentences.

9 tab-separated columns (columns 1-8 are based on CoNLL-X Format [3]):

ID
FORM (masked)
LEMMA (masked)
CPOSTAG
POSTAG
FEATS
HEAD
DEPREL
Filename

References

[1] Akihiko Kato, Hiroyuki Shindo and Yuji Matsumoto. 2018. Construction of Large-scale English Verbal Multiword Expression Annotated Corpus. LREC 2018 (to appear)
[2] Marie-Catherine de Marneffe, Christopher D. Manning. 2008. The Stanford Typed Dependencies Representation. Coling 2008: Proceedings of the workshop on Cross-Framework and Cross-Domain Parser Evaluation, pages 1–8, Manchester, UK. Coling 2008 Organizing Committee. (http://www.aclweb.org/anthology/W08-1301)
[3] CoNLL-X Shared Task: Multi-lingual Dependency Parsing (http://ilk.uvt.nl/conll/)

History

Ver 1.0: 2018-03-04.

Contact

Please e-mail kato.akihiko.ju6 /at/ is.naist.jp with questions.

Contributors

Akihiko Kato
Hiroyuki Shindo
Yuji Matsumoto

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
README.md		README.md
ontonotes_wsj_00_24_masked.conll		ontonotes_wsj_00_24_masked.conll
readme.txt		readme.txt
vmwe_indices.txt		vmwe_indices.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Verbal-MWE-annotations

Files

VMWE annotations (vmwe_indices.txt)

Dependency (ontonotes_wsj_00_24_masked.conll)

Conll Format

References

History

Contact

Contributors

About

Releases

Packages

naist-cl-parsing/Verbal-MWE-annotations

Folders and files

Latest commit

History

Repository files navigation

Verbal-MWE-annotations

Files

VMWE annotations (vmwe_indices.txt)

Dependency (ontonotes_wsj_00_24_masked.conll)

Conll Format

References

History

Contact

Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages