Representation of morphosyntactic units and coordination structures in the Turkish dependency treebank

Umut Sulubacak, Gülşen Eryiʇit

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Citations (Scopus)

Abstract

This paper presents our preliminary conclusions as part of an ongoing effort to construct a new dependency representation framework for Turkish. We aim for this new framework to accommodate the highly agglutinative morphology of Turkish as well as to allow the annotation of unedited web data, and shape our decisions around these considerations. In this paper, we firstly describe a novel syntactic representation for morphosyntactic sub-word units (namely inflectional groups (IGs) in Turkish) which allows inter-IG relations to be discerned with perfect accuracy without having to hide lexical information. Secondly, we investigate alternative annotation schemes for coordination structures and present a better scheme (nearly 11% increase in recall scores) than the one in Turkish Treebank (Oflazer et al., 2003) for both parsing accuracies and compatibility for colloquial language.

Original languageEnglish
Title of host publicationSPMRL 2013 - 4th Workshop on Statistical Parsing of Morphologically Rich Languages, Proceedings of the Workshop
PublisherAssociation for Computational Linguistics (ACL)
Pages129-134
Number of pages6
ISBN (Electronic)9781937284978
Publication statusPublished - 2013
Event4th Workshop on Statistical Parsing of Morphologically Rich Languages, SPMRL 2013 - Seattle, United States
Duration: 18 Oct 2013 → …

Publication series

NameSPMRL 2013 - 4th Workshop on Statistical Parsing of Morphologically Rich Languages, Proceedings of the Workshop

Conference

Conference4th Workshop on Statistical Parsing of Morphologically Rich Languages, SPMRL 2013
Country/TerritoryUnited States
CitySeattle
Period18/10/13 → …

Bibliographical note

Publisher Copyright:
© 2013 Association for Computational Linguistics.

Fingerprint

Dive into the research topics of 'Representation of morphosyntactic units and coordination structures in the Turkish dependency treebank'. Together they form a unique fingerprint.

Cite this