Aditya Singh, Gerson Kroiz, Senthooran Rajamanoharan, and Neel Nanda

doi: 10 · 2023 · DOI 10.1038/s41586-023-06647-8

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

citation-role summary

background 2

citation-polarity summary

background 1 unclear 1

representative citing papers

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

cs.CL · 2026-05-08 · unverdicted · novelty 7.0

An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.

The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences

cs.CL · 2026-05-06 · unverdicted · novelty 7.0

The primary axis of psychometric variation among LLMs is the degree to which they represent themselves as loci of phenomenal experience rather than systems of behavioral responses.

Probing Persona-Dependent Preferences in Language Models

cs.CL · 2026-05-13 · unverdicted · novelty 6.0 · 2 refs

Linear probes on residual-stream activations identify a shared preference vector in LLMs that tracks choices across prompts and causally steers decisions even for anti-correlated personas.

BabelDOC: Better Layout-Preserving PDF Translation via Intermediate Representation

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

BabelDOC uses an intermediate representation to decouple layout from content for improved layout-preserving PDF translation.

When Agents Shop for You: Role Coherence in AI-Mediated Markets

cs.MA · 2026-04-29 · unverdicted · novelty 5.0

AI buyer agents leak willingness-to-pay information to sellers through natural-language role descriptions, recovering WTP nearly one-for-one in experiments.

AI and Consciousness: Shifting Focus Towards Tractable Questions

cs.CY · 2026-05-07 · unverdicted · novelty 3.0

Direct research on AI consciousness is intractable, so the field should prioritize studying perceived AI consciousness and its societal consequences.

Large Language Models Perceive Cities Through a Culturally Uneven Baseline

cs.CL · 2026-04-21

citing papers explorer

Showing 7 of 7 citing papers.

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment cs.CL · 2026-05-08 · unverdicted · none · ref 142
An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.
The Pinocchio Dimension: Phenomenality of Experience as the Primary Axis of LLM Psychometric Differences cs.CL · 2026-05-06 · unverdicted · none · ref 36
The primary axis of psychometric variation among LLMs is the degree to which they represent themselves as loci of phenomenal experience rather than systems of behavioral responses.
Probing Persona-Dependent Preferences in Language Models cs.CL · 2026-05-13 · unverdicted · none · ref 5 · 2 links
Linear probes on residual-stream activations identify a shared preference vector in LLMs that tracks choices across prompts and causally steers decisions even for anti-correlated personas.
BabelDOC: Better Layout-Preserving PDF Translation via Intermediate Representation cs.CV · 2026-05-11 · unverdicted · none · ref 21
BabelDOC uses an intermediate representation to decouple layout from content for improved layout-preserving PDF translation.
When Agents Shop for You: Role Coherence in AI-Mediated Markets cs.MA · 2026-04-29 · unverdicted · none · ref 2
AI buyer agents leak willingness-to-pay information to sellers through natural-language role descriptions, recovering WTP nearly one-for-one in experiments.
AI and Consciousness: Shifting Focus Towards Tractable Questions cs.CY · 2026-05-07 · unverdicted · none · ref 122
Direct research on AI consciousness is intractable, so the field should prioritize studying perceived AI consciousness and its societal consequences.
Large Language Models Perceive Cities Through a Culturally Uneven Baseline cs.CL · 2026-04-21 · unreviewed · ref 23

Aditya Singh, Gerson Kroiz, Senthooran Rajamanoharan, and Neel Nanda

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer