Abstract
To be able to use supervised machine learning methods in natural language processing, there is a need of labeled data in large quantities. In some cases, especially when there are multiple tasks conducted on the same data, the annotation process may become exhausting and time consuming for both the annotatore and interpreters. Thus, an effective annotation tool becomes crucial in order to both increase the annotation quality and reduce the annotation time. In this paper, a semi-automatic annotation tool, which aims to decrease the manual work and user faults, is proposed. The interface of the tool is designed in a user-friendly manner in order to ease the process. The characteristics and input/output formats of the tool is explained in detail within the paper. The effects on the speed and accuracy of the users are analyzed as well as automatic labeling accuracy with conducted performance tests. It is noted that a deep-learning model trained with a small dataset can decrease the manual entity annotation workload up to 78, 43%.
Translated title of the contribution | A Semi-Automatic Annotation Interface for Named Entity and Relation Annotation on Document Images |
---|---|
Original language | Turkish |
Title of host publication | UBMK 2019 - Proceedings, 4th International Conference on Computer Science and Engineering |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 47-52 |
Number of pages | 6 |
ISBN (Electronic) | 9781728139647 |
DOIs | |
Publication status | Published - Sept 2019 |
Event | 4th International Conference on Computer Science and Engineering, UBMK 2019 - Samsun, Turkey Duration: 11 Sept 2019 → 15 Sept 2019 |
Publication series
Name | UBMK 2019 - Proceedings, 4th International Conference on Computer Science and Engineering |
---|
Conference
Conference | 4th International Conference on Computer Science and Engineering, UBMK 2019 |
---|---|
Country/Territory | Turkey |
City | Samsun |
Period | 11/09/19 → 15/09/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.