Beyond Human-Only: Evaluating Human-Machine Collaboration for Collecting High-Quality Translation Data

Liu, Zhongtao, Riley, Parker, Deutsch, Daniel, Lui, Alison, Niu, Mengmeng, Shah, Apurva · 2024 · DOI 10.18653/v1/2024.wmt-1.110

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

representative citing papers

Measuring User's Mental Models of Speech Translation in Human-AI Collaboration

cs.CL · 2026-06-23 · unverdicted · novelty 6.0

A cross-lingual QA framework shows users build stronger mental models of MT systems through practice and source language knowledge mainly by spotting surface-level errors, with transcriptions helping further.

Breaking the Likelihood Trap: Variance-Calibrated Modulation for Large Language Model Decoding

cs.CL · 2026-06-21 · unverdicted · novelty 6.0

VCM is a training-free decoding intervention that applies PMI-driven token elevation and variance-adaptive penalization to reduce repetitive degeneration in LLM open-ended generation.

Reinforcement Learning for Compositional Generalization with Outcome-Level Optimization

cs.LG · 2026-05-06 · unverdicted · novelty 4.0

Outcome-level RL with binary or composite rewards improves compositional generalization over supervised fine-tuning by avoiding overfitting to frequent training patterns.

SemEval-2026 Task 7: Everyday Knowledge Across Diverse Languages and Cultures

cs.CL · 2026-05-04 · unverdicted · novelty 4.0

SemEval-2026 Task 7 presents a benchmark and two evaluation tracks for assessing LLMs on everyday knowledge in diverse languages and cultures without allowing training on the test data.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Reinforcement Learning for Compositional Generalization with Outcome-Level Optimization cs.LG · 2026-05-06 · unverdicted · none · ref 146
Outcome-level RL with binary or composite rewards improves compositional generalization over supervised fine-tuning by avoiding overfitting to frequent training patterns.

Beyond Human-Only: Evaluating Human-Machine Collaboration for Collecting High-Quality Translation Data

fields

years

verdicts

representative citing papers

citing papers explorer