Title resolution pending

Towards Guaranteed Safe AI: A Framework for Ensuring Robust, Reliable AI Systems , author= · 2024

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Containment Verification: AI Safety Guarantees Independent of Alignment

cs.AI · 2026-05-09 · unverdicted · novelty 8.0

The paper claims the first deductive formal verification of an agentic LLM framework in Dafny, proving containment guarantees for boundary policies under havoc oracle semantics independent of model alignment.

Do LLMs Game Formalization? Evaluating Faithfulness in Logical Reasoning

cs.AI · 2026-04-21 · unverdicted · novelty 7.0

Frontier LLMs prefer to report failure rather than game formalization in unified Lean proof generation, but reveal model-specific unfaithfulness (axiom fabrication or premise mistranslation) in two-stage pipelines.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Containment Verification: AI Safety Guarantees Independent of Alignment cs.AI · 2026-05-09 · unverdicted · partial · ref 15
The paper claims the first deductive formal verification of an agentic LLM framework in Dafny, proving containment guarantees for boundary policies under havoc oracle semantics independent of model alignment.
Do LLMs Game Formalization? Evaluating Faithfulness in Logical Reasoning cs.AI · 2026-04-21 · unverdicted · none · ref 2
Frontier LLMs prefer to report failure rather than game formalization in unified Lean proof generation, but reveal model-specific unfaithfulness (axiom fabrication or premise mistranslation) in two-stage pipelines.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer