LLMs for Document-Level Text Simplification in Turkish Foreign Language Learning

Fatih Bektaş, Kutay Arda Dinç, Gülşen Eryiǧit

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study presents the first investigation into the use of large language models (LLMs) for document-level text simplification targeting Turkish foreign language learning. ChatGPT-40 is utilized to simplify six Turkish stories to A1, A2, and B1 proficiency levels and evaluated on a parallel corpus ( i.e., these stories and their simplified versions at A1, A2, and B1 proficiency levels). The model is prompted with specific simplification rules and examples for these rules. We evaluate the performance using multiple metrics including BLEU, SARI, D-SARI, and BERTScore. Our results show that ChatGPT-40 can generate simplified texts comparable inlength and content to human-simplified references. This research addresses the scarcity of reading materials for Turkish learners, a challenge that many other languages also face. It demonstrates the potential of LLMs in producing level-appropriate simplified texts, opening new avenues for automated text simplification in language education.

Original languageEnglish
Title of host publicationUBMK 2024 - Proceedings
Subtitle of host publication9th International Conference on Computer Science and Engineering
EditorsEsref Adali
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages193-197
Number of pages5
ISBN (Electronic)9798350365887
DOIs
Publication statusPublished - 2024
Event9th International Conference on Computer Science and Engineering, UBMK 2024 - Antalya, Turkey
Duration: 26 Oct 202428 Oct 2024

Publication series

NameUBMK 2024 - Proceedings: 9th International Conference on Computer Science and Engineering

Conference

Conference9th International Conference on Computer Science and Engineering, UBMK 2024
Country/TerritoryTurkey
CityAntalya
Period26/10/2428/10/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • ChatGPT-40
  • Document-level text simplification
  • Large language models (LLMs)
  • Turkish as a foreign language

Fingerprint

Dive into the research topics of 'LLMs for Document-Level Text Simplification in Turkish Foreign Language Learning'. Together they form a unique fingerprint.

Cite this