Bridging the gap: Toward cognitive autonomy in artificial intelligence,

· 2025 · arXiv 2512.02280

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Cognitive Firewall: A Proactive, Zero-Trust, Multi-Gate Framework for LLM Safety

cs.CR · 2026-07-01 · unverdicted · novelty 5.0

Cognitive Firewall applies four gates (intent, zero-trust context, consistency, output risk) via an oversight model to cut jailbreak success to 2% or below on most tested sets while keeping over-refusal at 8%.

citing papers explorer

Showing 1 of 1 citing paper.

Cognitive Firewall: A Proactive, Zero-Trust, Multi-Gate Framework for LLM Safety cs.CR · 2026-07-01 · unverdicted · none · ref 14
Cognitive Firewall applies four gates (intent, zero-trust context, consistency, output risk) via an oversight model to cut jailbreak success to 2% or below on most tested sets while keeping over-refusal at 8%.

Bridging the gap: Toward cognitive autonomy in artificial intelligence,

fields

years

verdicts

representative citing papers

citing papers explorer