pith. sign in

General agents contain world models, 2025

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

years

2026 4

verdicts

UNVERDICTED 4

clear filters

representative citing papers

The Impossibility of Eliciting Latent Knowledge

cs.AI · 2026-06-10 · unverdicted · novelty 7.0

Proves that no behavior-dependent feedback training strategy can guarantee an honest agent for latent knowledge even with perfect training feedback.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • The Impossibility of Eliciting Latent Knowledge cs.AI · 2026-06-10 · unverdicted · none · ref 31

    Proves that no behavior-dependent feedback training strategy can guarantee an honest agent for latent knowledge even with perfect training feedback.