TY - GEN
T1 - The incremental use of morphological information and lexicalization in data-driven dependency parsing
AU - Eryiǧit, Gülşen
AU - Nivre, Joakim
AU - Oflazer, Kemal
PY - 2006
Y1 - 2006
N2 - Typological diversity among the natural languages of the world poses interesting challenges for the models and algorithms used in syntactic parsing. In this paper, we apply a data-driven dependency parser to Turkish, a language characterized by rich morphology and flexible constituent order, and study the effect of employing varying amounts of morpholexical information on parsing performance. The investigations show that accuracy can be improved by using representations based on inflectional groups rather than word forms, confirming earlier studies. In addition, lexicalization and the use of rich morphological features are found to have a positive effect. By combining all these techniques, we obtain the highest reported accuracy for parsing the Turkish Treebank.
AB - Typological diversity among the natural languages of the world poses interesting challenges for the models and algorithms used in syntactic parsing. In this paper, we apply a data-driven dependency parser to Turkish, a language characterized by rich morphology and flexible constituent order, and study the effect of employing varying amounts of morpholexical information on parsing performance. The investigations show that accuracy can be improved by using representations based on inflectional groups rather than word forms, confirming earlier studies. In addition, lexicalization and the use of rich morphological features are found to have a positive effect. By combining all these techniques, we obtain the highest reported accuracy for parsing the Turkish Treebank.
UR - http://www.scopus.com/inward/record.url?scp=77049122042&partnerID=8YFLogxK
U2 - 10.1007/11940098_53
DO - 10.1007/11940098_53
M3 - Conference contribution
AN - SCOPUS:77049122042
SN - 354049667X
SN - 9783540496670
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 498
EP - 508
BT - Computer Processing of Oriental Languages - Beyond the Orient
T2 - 21st International Conference on Computer Processing of Oriental Languages: Beyond the Orient: The Research Challenges Ahead, ICCPOL 2006
Y2 - 17 December 2006 through 19 December 2006
ER -