TY - GEN
T1 - Disambiguating main POS tags for Turkish
AU - Ehsani, Razieh
AU - Alper, Muzaffer Ege
AU - Eryiǧit, Gülşen
AU - Adali, Eşref
PY - 2012
Y1 - 2012
N2 - This paper presents the results of main part-of-speech tagging of Turkish sentences using Conditional Random Fields (CRFs). Although CRFs are applied to many different languages for part-of-speech (POS) tagging, Turkish poses interesting challenges to be modeled with them. The challenges include issues related to the statistical model of the problem as well as issues related to computational complexity and scaling. In this paper, we propose a novel model for main-POS tagging in Turkish. Furthermore, we propose some approaches to reduce the computational complexity and allow better scaling characteristics or improve the performance without increased complexity. These approaches are discussed with respect to their advantages and disadvantages. We show that the best approach is competitive with the current state of the art in accuracy and also in training and test durations. The good results obtained imply a good first step towards full morphological disambiguation.
AB - This paper presents the results of main part-of-speech tagging of Turkish sentences using Conditional Random Fields (CRFs). Although CRFs are applied to many different languages for part-of-speech (POS) tagging, Turkish poses interesting challenges to be modeled with them. The challenges include issues related to the statistical model of the problem as well as issues related to computational complexity and scaling. In this paper, we propose a novel model for main-POS tagging in Turkish. Furthermore, we propose some approaches to reduce the computational complexity and allow better scaling characteristics or improve the performance without increased complexity. These approaches are discussed with respect to their advantages and disadvantages. We show that the best approach is competitive with the current state of the art in accuracy and also in training and test durations. The good results obtained imply a good first step towards full morphological disambiguation.
UR - http://www.scopus.com/inward/record.url?scp=84882971166&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84882971166
SN - 9789573079255
T3 - Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012
SP - 202
EP - 213
BT - Proceedings of the 24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012
T2 - 24th Conference on Computational Linguistics and Speech Processing, ROCLING 2012
Y2 - 21 September 2012 through 22 September 2012
ER -