Error annotation: a review and faceted taxonomy

Gülşen Eryiğit*, Anna Golynskaia, Elif Sayar, Tolgahan Türker

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Classification of errors in language use plays a crucial role in language learning & teaching, error analysis studies, and language technology development. However, there is no standard and inclusive error classification method agreed upon among different disciplines, which causes repetition of similar efforts and a barrier in front of a common understanding in the field. This article brings a new and holistic perspective to error classifications and annotation schemes across different fields (i.e., learner corpora research, error analysis, grammar error correction, and machine translation), all serving the same purpose but employing different methods and approaches. The article first reviews previous error annotation efforts from different fields for nineteen languages with different characteristics, including the morphologically rich ones that pose diverse challenges for language technologies. It then introduces a faceted taxonomy for errors in language use, comprising multidimensional and hierarchical facets that can be utilized to create both fine- and coarse-grained error annotation schemes depending on specific requirements. We believe that the proposed taxonomy based on the principles of universality and diversity will address the emerging need for a common framework in error annotation.

Original languageEnglish
JournalLanguage Resources and Evaluation
DOIs
Publication statusAccepted/In press - 2025

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive licence to Springer Nature B.V. 2025.

Keywords

  • Error classification
  • Learner corpus
  • Taxonomy

Fingerprint

Dive into the research topics of 'Error annotation: a review and faceted taxonomy'. Together they form a unique fingerprint.

Cite this