An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.
Aditya Singh, Gerson Kroiz, Senthooran Rajamanoharan, and Neel Nanda
7 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 7roles
background 2representative citing papers
The primary axis of psychometric variation among LLMs is the degree to which they represent themselves as loci of phenomenal experience rather than systems of behavioral responses.
Linear probes on residual-stream activations identify a shared preference vector in LLMs that tracks choices across prompts and causally steers decisions even for anti-correlated personas.
BabelDOC uses an intermediate representation to decouple layout from content for improved layout-preserving PDF translation.
AI buyer agents leak willingness-to-pay information to sellers through natural-language role descriptions, recovering WTP nearly one-for-one in experiments.
Direct research on AI consciousness is intractable, so the field should prioritize studying perceived AI consciousness and its societal consequences.
citing papers explorer
-
The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment
An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.
-
The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences
The primary axis of psychometric variation among LLMs is the degree to which they represent themselves as loci of phenomenal experience rather than systems of behavioral responses.
-
Probing Persona-Dependent Preferences in Language Models
Linear probes on residual-stream activations identify a shared preference vector in LLMs that tracks choices across prompts and causally steers decisions even for anti-correlated personas.
-
BabelDOC: Better Layout-Preserving PDF Translation via Intermediate Representation
BabelDOC uses an intermediate representation to decouple layout from content for improved layout-preserving PDF translation.
-
When Agents Shop for You: Role Coherence in AI-Mediated Markets
AI buyer agents leak willingness-to-pay information to sellers through natural-language role descriptions, recovering WTP nearly one-for-one in experiments.
-
AI and Consciousness: Shifting Focus Towards Tractable Questions
Direct research on AI consciousness is intractable, so the field should prioritize studying perceived AI consciousness and its societal consequences.
- Large Language Models Perceive Cities Through a Culturally Uneven Baseline