Özet
This paper presents the state-of-the-art results in Turkish coreference resolution (CR) which is a task of determining sets of mentions which identify the same real-world entity (e.g. a person, a place, a thing, an event). The proposed system uses support vector machines and solves the CR task with a mention-pair model that basically accepts mention couples and decides on whether they are coreferential with each other or not. The results are evaluated on Marmara Turkish Coreference Corpus by using well-known evaluation metrics (viz. MUC, B3, BLANC and LEA). The introduced approach obtains F1 scores of 90.68% (MUC), 86.89% (B3), 85.13% (BLANC) and 78.34% (LEA) yielding an improvement of 9.12, 16.06, 13.08 and 12.57 percentage points respectively over a recent baseline system on Turkish CR. The paper introduces the system setup (SVM parameters and negative sampling strategy) as well as the selected features and analyzes the impact of these features on the Turkish CR task.
Orijinal dil | İngilizce |
---|---|
Ana bilgisayar yayını başlığı | 2018 IEEE (SMC) International Conference on Innovations in Intelligent Systems and Applications, INISTA 2018 |
Editörler | Plamen Angelov, Tulay Yildirim, Lazaros Iliadis, Yannis Manolopoulos |
Yayınlayan | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Elektronik) | 9781538651506 |
DOI'lar | |
Yayın durumu | Yayınlandı - 14 Eyl 2018 |
Etkinlik | 2018 IEEE International Conference on Innovations in Intelligent Systems and Applications, INISTA 2018 - Thessaloniki, Greece Süre: 3 Tem 2018 → 5 Tem 2018 |
Yayın serisi
Adı | 2018 IEEE (SMC) International Conference on Innovations in Intelligent Systems and Applications, INISTA 2018 |
---|
???event.eventtypes.event.conference???
???event.eventtypes.event.conference??? | 2018 IEEE International Conference on Innovations in Intelligent Systems and Applications, INISTA 2018 |
---|---|
Ülke/Bölge | Greece |
Şehir | Thessaloniki |
Periyot | 3/07/18 → 5/07/18 |
Bibliyografik not
Publisher Copyright:© 2018 IEEE.
Finansman
This work is part of a research project supported by ITU Scientific Research Projects Grant no: MYL-2017-40971.
Finansörler | Finansör numarası |
---|---|
International Technological University | MYL-2017-40971 |