Skip to content

LLOD-Ru/OpenCorpora-RuWordNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

Sense-Annotated Corpus for Russian

The resource was obtained my manually annotating texts from the OpenCorpora corpus by senses of the Russian wordnet RuWordNet.

Statistics:

Entity Count
Documents 807
Sentences 6,751
Tokens 109,893
Annotated tokens 46,320
Lexical entries 17,126
Annotated lexical entries 10,683
RuWordNet synsets 8,619

Please, cite as:

Alexander Kirillovich, Natalia Loukachevitch, Maksim Kulaev, Angelina Bolshina, Dmitry Ilvovsky. Sense-Annotated Corpus for Russian // Proceedings of the 5th International Conference on Computational Linguistics in Bulgaria (CLIB 2022), Sofia, Bulgaria, 8–9 September 2022. Bulgarian Academy of Sciences (forthcoming).

Distributed under:

Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0).

For any questions:

[email protected].