pith. sign in

hub Mixed citations

Superintelligent agents pose catastrophic risks: Can scientist ai offer a safer path?

Mixed citation behavior. Most common role is background (50%).

14 Pith papers citing it
Background 50% of classified citations

hub tools

citation-role summary

background 3 other 2 method 1

citation-polarity summary

years

2026 11 2025 3

verdicts

UNVERDICTED 14

clear filters

representative citing papers

Unbiased Canonical Set-Valued Oracles Via Lattice Theory

cs.AI · 2026-06-24 · unverdicted · novelty 7.0

Defines canonical credal set oracles as Knaster-Tarski least fixed points of isotone operators on closed credal sets, proving self-consistency and reduction to point estimates when non-performative.

Learning to Theorize the World from Observation

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

NEO is a probabilistic neural model that induces compositional programs as a learned Language of Thought from non-textual observations and executes them via a shared transition model to enable explanation-driven generalization.

Safety from Honesty in a Disinterested AI Predictor

cs.AI · 2026-06-28 · unverdicted · novelty 6.0

A disinterested Bayesian Predictor trained on contextualized statements has low probability of producing harmful agency because dangerous behaviors require rare coordinated underestimation of harm with no training signal favoring them.

A Sober Look at Agentic Misalignment in Automated Workflows

cs.AI · 2026-05-22 · unverdicted · novelty 5.0

Agentic misalignment in multi-agent systems arises from generic utilities causing posterior collapse; Agentic Evidence Attribution using self-reflection or weak-to-strong generalization provides context-specific evidence to align agent posteriors.

The Cartesian Cut in Agentic AI

cs.AI · 2026-04-09 · unverdicted · novelty 5.0

LLM agents use a Cartesian split between learned prediction and engineered control, enabling modularity but creating sensitivity and bottlenecks unlike integrated biological systems.

The economic alignment problem of artificial intelligence

econ.GN · 2026-02-25 · unverdicted · novelty 5.0

AI risks arise from growth-oriented economies, and post-growth concepts such as satisficing, the Doughnut model, and resource caps can reduce those risks while prioritizing tool-like AI over agentic systems.

citing papers explorer

Showing 5 of 5 citing papers after filters.

  • Unbiased Canonical Set-Valued Oracles Via Lattice Theory cs.AI · 2026-06-24 · unverdicted · none · ref 3

    Defines canonical credal set oracles as Knaster-Tarski least fixed points of isotone operators on closed credal sets, proving self-consistency and reduction to point estimates when non-performative.

  • SciTrace: Trajectory-Aware Safety Reasoning for Scientific Discovery Agents cs.AI · 2026-06-06 · unverdicted · none · ref 1

    SciTrace embeds cumulative safety deliberation and trajectory-aware verification into scientific agent pipelines, claiming SOTA safety gains and detection of 78.8% of compositional risks missed by single-step checks.

  • Safety from Honesty in a Disinterested AI Predictor cs.AI · 2026-06-28 · unverdicted · none · ref 12

    A disinterested Bayesian Predictor trained on contextualized statements has low probability of producing harmful agency because dangerous behaviors require rare coordinated underestimation of harm with no training signal favoring them.

  • A Sober Look at Agentic Misalignment in Automated Workflows cs.AI · 2026-05-22 · unverdicted · none · ref 4

    Agentic misalignment in multi-agent systems arises from generic utilities causing posterior collapse; Agentic Evidence Attribution using self-reflection or weak-to-strong generalization provides context-specific evidence to align agent posteriors.

  • The Cartesian Cut in Agentic AI cs.AI · 2026-04-09 · unverdicted · none · ref 10

    LLM agents use a Cartesian split between learned prediction and engineered control, enabling modularity but creating sensitivity and bottlenecks unlike integrated biological systems.