pith. machine review for the scientific record. sign in

arxiv: 1509.09088 · v1 · submitted 2015-09-30 · 💻 cs.CL · stat.ML

Recognition: unknown

Enhanced Bilingual Evaluation Understudy

Authors on Pith no claims yet
classification 💻 cs.CL stat.ML
keywords evaluationtechniquebilingualbleuexistinghumanmethodsunderstudy
0
0 comments X
read the original abstract

Our research extends the Bilingual Evaluation Understudy (BLEU) evaluation technique for statistical machine translation to make it more adjustable and robust. We intend to adapt it to resemble human evaluation more. We perform experiments to evaluate the performance of our technique against the primary existing evaluation methods. We describe and show the improvements it makes over existing methods as well as correlation to them. When human translators translate a text, they often use synonyms, different word orders or style, and other similar variations. We propose an SMT evaluation technique that enhances the BLEU metric to consider variations such as those.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Analyzing Chain of Thought (CoT) Approaches in Control Flow Code Deobfuscation Tasks

    cs.SE 2026-04 unverdicted novelty 4.0

    CoT prompting improves LLM performance on control-flow deobfuscation of C benchmarks, yielding ~16% better CFG reconstruction and ~20.5% better semantic preservation for GPT5 versus zero-shot prompting.