This category includes extremely graphic violence, threats, and support for violence

Violence: Content that depicts or shows support for physical violence

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Static Benchmarks: Synthesizing Harmful Content via Persona-based Simulation for Robust Evaluation

cs.CL · 2026-04-18 · unverdicted · novelty 5.0

A two-dimensional persona simulation framework generates harmful content that is more challenging to detect and comparably diverse to human-curated datasets for robust evaluation of detection systems.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Static Benchmarks: Synthesizing Harmful Content via Persona-based Simulation for Robust Evaluation cs.CL · 2026-04-18 · unverdicted · none · ref 22
A two-dimensional persona simulation framework generates harmful content that is more challenging to detect and comparably diverse to human-curated datasets for robust evaluation of detection systems.

This category includes extremely graphic violence, threats, and support for violence

fields

years

verdicts

representative citing papers

citing papers explorer