The feasibility analysis of re-ranking for N-best lists on English-Turkish machine translation

Ezgi Yildirim, Ahmet Cuneyd Tantug

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we present the results of re-ranking for N-best list on machine translations. The main purpose of this research is to determine the upper bound of MT success that can be gained by reordering possible candidate translations. We use Google Translate Research API1 as our Statistical Machine Translation (SMT) system to get the N-best lists consisting of possible Turkish translations for a given English sentence. We evaluate the effect of reordering using three simple methods: unigram count (UC), unigram ratio (UR), and first four characters match (FFCM). We collected 720 sentences in order to give to the SMT system, and then we used 3 different sets of Turkish translations of them to evaluate our work on the N-best lists. Success of re-ranking is determined by using BLEU metric, besides an inclusive investigation which is necessary especially for agglutinative languages (e.g. Turkish, Czech, Hungarian, and Finnish) is performed by using BLEU+ MT scoring tool. We observe an improvement in BLEU score from 31.71 for the baseline system to 35.46 which is about 11.81% relative for the re-ranked model using UR.

Original languageEnglish
Title of host publication2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013
DOIs
Publication statusPublished - 2013
Event2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013 - Albena, Bulgaria
Duration: 19 Jun 201321 Jun 2013

Publication series

Name2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013

Conference

Conference2013 IEEE International Symposium on Innovations in Intelligent Systems and Applications, IEEE INISTA 2013
Country/TerritoryBulgaria
CityAlbena
Period19/06/1321/06/13

Keywords

  • BLEU+
  • machine translation
  • n-best list
  • re-ranking

Fingerprint

Dive into the research topics of 'The feasibility analysis of re-ranking for N-best lists on English-Turkish machine translation'. Together they form a unique fingerprint.

Cite this