Evaluation of wizard-of-Oz and self-play data collection techniques for turkish goal-oriented dialogue agents

Dogukan Arslan, Gulsen Eryigit

Araştırma sonucu: ???type-name???Konferans katkısıbilirkişi

Özet

As with all natural language processing tasks, the lack of open-source training data required for the development of dialogue agents is a major obstacle to research studies in the field. Especially languages that are not widely studied, such as Turkish, suffer more from this problem. This article introduces a comparison of Wizard-of-Oz and self-play data collection techniques for Turkish goal-oriented dialogue system generation. Three data sets have been prepared and introduced to the researchers by using these techniques. Being the first publicly available human-to-human Turkish dialogue data sets, although open for development, the created resources from the restaurant domain are very valuable for further research on Turkish dialogue systems. The mentioned methods are quantitatively compared on the produced data sets, in terms of dialog act classification and slot identification scores. Since it is costly to collect data with methods like Wizard-Of-Oz in every domain, an open-source flexible and easy-to-use framework is also provided implementing self-play which may be used to create machine-to-machine dialogue outlines and speed data collection for low-resource languages like Turkish. Besides, designed templates of annotation screens for crowdsourcing are provided for future studies.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığı2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Proceedings
EditörlerZeynep Hilal Kilimci, Tulay Yildirim, Vincenzo Piuri, Ireneusz Czarnowski, David Camacho, Yannis Manolopoulos, Serdar Solak
YayınlayanInstitute of Electrical and Electronics Engineers Inc.
ISBN (Elektronik)9781665436038
DOI'lar
Yayın durumuYayınlandı - 25 Ağu 2021
Etkinlik2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Kocaeli, Turkey
Süre: 25 Ağu 202127 Ağu 2021

Yayın serisi

Adı2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Proceedings

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021
Ülke/BölgeTurkey
ŞehirKocaeli
Periyot25/08/2127/08/21

Bibliyografik not

Publisher Copyright:
© 2021 IEEE.

Parmak izi

Evaluation of wizard-of-Oz and self-play data collection techniques for turkish goal-oriented dialogue agents' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap