arXiv preprint arXiv:2412.15035 , year=

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps , author= · 2024 · arXiv 2412.15035

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

RedVox: Safety and Fairness Gaps in Speech Models Across Languages

cs.CL · 2026-06-25 · unverdicted · novelty 7.0

RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.

Multilingual Refusal Alignment for Safer Large Language Models

cs.CL · 2026-04-24 · conditional · novelty 5.0

English-only safety alignment fails to transfer cross-lingually, while multilingual DPO training on the new RefusEU dataset improves safety across 12 European languages without degrading Global MMLU performance.

citing papers explorer

Showing 2 of 2 citing papers.

RedVox: Safety and Fairness Gaps in Speech Models Across Languages cs.CL · 2026-06-25 · unverdicted · none · ref 104
RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.
Multilingual Refusal Alignment for Safer Large Language Models cs.CL · 2026-04-24 · conditional · none · ref 17
English-only safety alignment fails to transfer cross-lingually, while multilingual DPO training on the new RefusEU dataset improves safety across 12 European languages without degrading Global MMLU performance.

arXiv preprint arXiv:2412.15035 , year=

fields

years

verdicts

representative citing papers

citing papers explorer