TY - GEN
T1 - BLEU+
T2 - 6th International Conference on Language Resources and Evaluation, LREC 2008
AU - Tantuǧ, A. Cüneyd
AU - Oflazer, Kemal
AU - El-Kahlout, İlknur D.
PY - 2008
Y1 - 2008
N2 - We present a tool, BLEU+, which implements various extension to BLEU computation to allow for a better understanding of the translation performance, especially for morphologically complex languages. BLEU+ takes into account both "closeness" in morphological structure, "closeness" of the root words in the WordNet hierarchy while comparing tokens in the candidate and reference sentence. In addition to gauging performance at a finer level of granularity, BLEU+ also allows the computation of various upper bound oracle scores: comparing all tokens considering only the roots allows us to get an upper bound when all errors due to morphological structure are fixed, while comparing tokens in an error-tolerant way considering minor morpheme edit operations, allows us to get a (more realistic) upper bound when tokens that differ in morpheme insertions/deletions and substitutions are fixed. We use BLEU+ in the fine-grained evaluation of the output of our English-to-Turkish statistical MT system.
AB - We present a tool, BLEU+, which implements various extension to BLEU computation to allow for a better understanding of the translation performance, especially for morphologically complex languages. BLEU+ takes into account both "closeness" in morphological structure, "closeness" of the root words in the WordNet hierarchy while comparing tokens in the candidate and reference sentence. In addition to gauging performance at a finer level of granularity, BLEU+ also allows the computation of various upper bound oracle scores: comparing all tokens considering only the roots allows us to get an upper bound when all errors due to morphological structure are fixed, while comparing tokens in an error-tolerant way considering minor morpheme edit operations, allows us to get a (more realistic) upper bound when tokens that differ in morpheme insertions/deletions and substitutions are fixed. We use BLEU+ in the fine-grained evaluation of the output of our English-to-Turkish statistical MT system.
UR - http://www.scopus.com/inward/record.url?scp=84925992419&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84925992419
T3 - Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008
SP - 1493
EP - 1499
BT - Proceedings of the 6th International Conference on Language Resources and Evaluation, LREC 2008
PB - European Language Resources Association (ELRA)
Y2 - 28 May 2008 through 30 May 2008
ER -