- Do not penalize Plausibility just because it differs from the source; penalize only if it becomes illogical or self-contradictory

Plausibility (internal coherence / naturalness): - Whether the candidate is internally consistent, reads like a natural sentence that makes sense on its own

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation

cs.CL · 2026-04-06 · unverdicted · novelty 6.0

OmniScore is a family of lightweight deterministic learned metrics that approximate LLM-judge behavior for reliable multilingual evaluation of generative text in tasks such as QA, translation, and summarization.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond LLM-as-a-Judge: Deterministic Metrics for Multilingual Generative Text Evaluation cs.CL · 2026-04-06 · unverdicted · none · ref 23
OmniScore is a family of lightweight deterministic learned metrics that approximate LLM-judge behavior for reliable multilingual evaluation of generative text in tasks such as QA, translation, and summarization.

- Do not penalize Plausibility just because it differs from the source; penalize only if it becomes illogical or self-contradictory

fields

years

verdicts

representative citing papers

citing papers explorer