The incremental use of morphological information and lexicalization in data-driven dependency parsing

Gülşen Eryiǧit*, Joakim Nivre, Kemal Oflazer

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Citations (Scopus)

Abstract

Typological diversity among the natural languages of the world poses interesting challenges for the models and algorithms used in syntactic parsing. In this paper, we apply a data-driven dependency parser to Turkish, a language characterized by rich morphology and flexible constituent order, and study the effect of employing varying amounts of morpholexical information on parsing performance. The investigations show that accuracy can be improved by using representations based on inflectional groups rather than word forms, confirming earlier studies. In addition, lexicalization and the use of rich morphological features are found to have a positive effect. By combining all these techniques, we obtain the highest reported accuracy for parsing the Turkish Treebank.

Original languageEnglish
Title of host publicationComputer Processing of Oriental Languages - Beyond the Orient
Subtitle of host publicationThe Research Challenges Ahead - 21st International Conference, ICCPOL 2006, Proceedings
Pages498-508
Number of pages11
DOIs
Publication statusPublished - 2006
Event21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006 - Singapore, Singapore
Duration: 17 Dec 200619 Dec 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4285 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006
Country/TerritorySingapore
CitySingapore
Period17/12/0619/12/06

Fingerprint

Dive into the research topics of 'The incremental use of morphological information and lexicalization in data-driven dependency parsing'. Together they form a unique fingerprint.

Cite this