CoRR, abs/2506.11094

The Scales of Justitia: A Comprehensive Survey on Safety Evaluation of LLMs · 2025 · arXiv 2506.11094

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Safe for Whom? Rethinking How We Evaluate the Safety of LLMs for Real Users

cs.AI · 2025-12-11 · unverdicted · novelty 6.0

LLM safety evaluations for personal advice must test responses against diverse user vulnerability profiles, since context-blind ratings overestimate safety and realistic prompt context does not fix the problem.

LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models

cs.CL · 2025-08-07 · conditional · novelty 6.0

LLMEval-Fair introduces a dynamic, contamination-resistant evaluation framework for LLMs based on a large question bank and validates it via a 30-month study of nearly 60 models showing performance ceilings and hidden contamination issues.

citing papers explorer

Showing 2 of 2 citing papers.

Safe for Whom? Rethinking How We Evaluate the Safety of LLMs for Real Users cs.AI · 2025-12-11 · unverdicted · none · ref 11
LLM safety evaluations for personal advice must test responses against diverse user vulnerability profiles, since context-blind ratings overestimate safety and realistic prompt context does not fix the problem.
LLMEval-Fair: A Large-Scale Longitudinal Study on Robust and Fair Evaluation of Large Language Models cs.CL · 2025-08-07 · conditional · none · ref 8
LLMEval-Fair introduces a dynamic, contamination-resistant evaluation framework for LLMs based on a large question bank and validates it via a 30-month study of nearly 60 models showing performance ceilings and hidden contamination issues.

CoRR, abs/2506.11094

fields

years

verdicts

representative citing papers

citing papers explorer