Eight AI models show split value priorities at the top layer, divergent evidence preferences in the middle, and broad convergence on institutional sources at the bottom, with substantial sensitivity to scenario framing.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.AI 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
PRISM defines 27 behavioral risk signals from structural anomalies in AI value, evidence, and source hierarchies, evaluated via dual thresholds on forced-choice data from seven models.
citing papers explorer
-
Measuring the Authority Stack of AI Systems: Empirical Analysis of 366,120 Forced-Choice Responses Across 8 AI Models
Eight AI models show split value priorities at the top layer, divergent evidence preferences in the middle, and broad convergence on institutional sources at the bottom, with substantial sensitivity to scenario framing.
-
PRISM Risk Signal Framework: Hierarchy-Based Red Lines for AI Behavioral Risk
PRISM defines 27 behavioral risk signals from structural anomalies in AI value, evidence, and source hierarchies, evaluated via dual thresholds on forced-choice data from seven models.