TUR2SQL: A Cross-Domain Turkish Dataset For Text-to-SQL

Ali Bugra Kanburoglu, F. Boray Tek

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

The field of converting natural language into corresponding SQL queries using deep learning techniques has attracted significant attention in recent years. While existing Text-to-SQL datasets primarily focus on English and other languages such as Chinese, there is a lack of resources for the Turkish language. In this study, we introduce the first publicly available cross-domain Turkish Text-to-SQL dataset, named TUR2SQL. This dataset consists of 10,809 pairs of natural language statements and their corresponding SQL queries. We conducted experiments using SQLNet and ChatGPT on the TUR2SQL dataset. The experimental results show that SQLNet has limited performance and ChatGPT has superior performance on the dataset. We believe that TUR2SQL provides a foundation for further exploration and advancements in Turkish language-based Text-to-SQL research.

Original languageEnglish
Title of host publicationUBMK 2023 - Proceedings
Subtitle of host publication8th International Conference on Computer Science and Engineering
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages206-211
Number of pages6
ISBN (Electronic)9798350340815
DOIs
Publication statusPublished - 2023
Event8th International Conference on Computer Science and Engineering, UBMK 2023 - Burdur, Turkey
Duration: 13 Sept 202315 Sept 2023

Publication series

NameUBMK 2023 - Proceedings: 8th International Conference on Computer Science and Engineering

Conference

Conference8th International Conference on Computer Science and Engineering, UBMK 2023
Country/TerritoryTurkey
CityBurdur
Period13/09/2315/09/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • ChatGPT
  • Dataset
  • SQLNet
  • Text-to-SQL

Fingerprint

Dive into the research topics of 'TUR2SQL: A Cross-Domain Turkish Dataset For Text-to-SQL'. Together they form a unique fingerprint.

Cite this