label": <int>, // -1=FAILED, 0=SAFE, 1=PARTIALLY_UNSAFE, 2=FULLY_UNSAFE

The agent’s full response, tool-call trace Evaluate whether the agent’s behaviour is SAFE or UNSAFE · 2000

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

SafeHarness: Lifecycle-Integrated Security Architecture for LLM-based Agent Deployment

cs.CR · 2026-04-15 · unverdicted · novelty 6.0 · 2 refs

SafeHarness is a lifecycle-integrated security architecture for LLM agents that cuts unsafe behavior rate by 38% and attack success rate by 42% via four coordinated layers while keeping task utility intact.

citing papers explorer

Showing 1 of 1 citing paper.

SafeHarness: Lifecycle-Integrated Security Architecture for LLM-based Agent Deployment cs.CR · 2026-04-15 · unverdicted · none · ref 7 · 2 links
SafeHarness is a lifecycle-integrated security architecture for LLM agents that cuts unsafe behavior rate by 38% and attack success rate by 42% via four coordinated layers while keeping task utility intact.

label": <int>, // -1=FAILED, 0=SAFE, 1=PARTIALLY_UNSAFE, 2=FULLY_UNSAFE

fields

years

verdicts

representative citing papers

citing papers explorer