Crucially, RATE consistently maintains superior performance over the baseline, confirming that our framework’s effectiveness holds across different backbone mod- els

As shown in the table, replacing the backbone model yields a slight performance improvement for both methods on the average meta scores · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation

cs.CL · 2026-01-12 · conditional · novelty 7.0

MENT benchmark plus RATE agentic evaluator raise combined system- and segment-level correlation with human judgments by at least 3.2 points over prior MT metrics and LLM judges.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation cs.CL · 2026-01-12 · conditional · none · ref 7
MENT benchmark plus RATE agentic evaluator raise combined system- and segment-level correlation with human judgments by at least 3.2 points over prior MT metrics and LLM judges.

Crucially, RATE consistently maintains superior performance over the baseline, confirming that our framework’s effectiveness holds across different backbone mod- els

fields

years

verdicts

representative citing papers

citing papers explorer