Skip to content

Aatlantise/k-snacs

 
 

Repository files navigation

K-SNACS Dataset and Guidelines

Korean Semantic Network of Adposition and Case Supersense

Dataset

File: little_prince_ko.tsv

Data Version:

  • Current: 1.0
  • Compatible with K-SNACS Guidelines v0.9

Data Info:

  • Title: 어린 왕자 (erin wangca) "The Little Prince"
  • Author: Atoine de Saint-Exupéry
  • Original Language: French (Le Petit Prince)
  • Genre: Childrens literature, Novella

Column Description:

  • doc_id: chapter number, starts at 1
  • sent_id: sentence id, starts at 1
  • token_id: token id, starts at 1. Stacked postpositions receive token_id-N designation, where N is a sequencial value starting at 1 (as an example see doc_id 1, sent_id 14, token_id 1).
  • form: word form
  • morph: morphological analysis obtained from KOMA Tagger.
  • p: postposition
  • gold_scene: gold adjudicated scene role label
  • gold_function: gold adjudicated function label

License: This dataset's supersense annotations are licensed under CC BY 4.0 (Creative Commons Attribution-ShareAlike 4.0 International license).

K-SNACS Guidelines

File: k-snacs-guideline-appendix-v0.9.pdf

Guideline Version:

  • Current: 0.9
  • Compatible with English SNACS v2.5
  • Please note that this document is an appendix to the above English SNACS guidelines, including only language-specific information that merits further detailing. For full definitions of labels and use cases, please refer to English guidelines.

Paper

Please cite the following when using this data:

Hwang et al., 2020:

Hwang, Jena D., Hanwool Choe, Na-Rae Han, and Nathan Schneider. "K-SNACS: Annotating Korean adposition semantics." In Proceedings of the Second International Workshop on Designing Meaning Representations. 2020.

K-SNACS Team

Key Collaborators:

Special Thanks to:

This research was supported in part by:

  • NSF award IIS-1812778
  • BSF grant 2016375

About

K-SNACS dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%