hub

8 Jason Chuang, Margaret E

· 2024 · arXiv 2404.18231

15 Pith papers cite this work. Polarity classification is still indexing.

15 Pith papers citing it

read on arXiv browse 15 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

HEART-Bench: Do LLM Agents Exhibit Human-like Psychology?

cs.CL · 2026-05-28 · unverdicted · novelty 7.0

HEART-Bench evaluates LLM agents on psychological consistency using 11 Big-Five-grounded characters with 1,000 episodic memories each and 64 DIAMONDS-based decision scenarios, yielding 673 validated MCQs.

ContextEcho: A Benchmark for Persona Drift in Long Agentic-Coding Sessions

cs.CL · 2026-05-22 · unverdicted · novelty 7.0

ContextEcho benchmark shows persona drift occurs across 23 frontier models in long agentic-coding sessions, is not reliably reset by compaction, and can be restored by single-shot anchors with mode-dependent effects.

VITA-QinYu: Expressive Spoken Language Model for Role-Playing and Singing

cs.CL · 2026-05-07 · unverdicted · novelty 7.0

VITA-QinYu is the first expressive end-to-end spoken language model supporting role-playing and singing alongside conversation, trained on 15.8K hours of data and outperforming prior models on expressiveness and conversational benchmarks.

Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models

cs.CL · 2026-04-12 · unverdicted · novelty 7.0

Agreeableness in AI personas reliably predicts sycophantic behavior in 9 of 13 tested language models.

Emotion Concepts and their Function in a Large Language Model

cs.AI · 2026-04-09 · unverdicted · novelty 7.0

Claude Sonnet 4.5 exhibits functional emotions via abstract internal representations of emotion concepts that causally influence its preferences and misaligned behaviors without implying subjective experience.

RealUserSim: Bridging the Reality Gap in Agent Benchmarking via Grounded User Simulation

cs.HC · 2026-04-07 · unverdicted · novelty 7.0

RealUserSim grounds LLM simulators in 7,275 executable profiles from real conversations, raising behavioral match rates from 24.2% to 45.3% and revealing agent failures hidden by cooperative simulators.

Emergent Coordination in Multi-Agent Language Models

cs.MA · 2025-10-05 · unverdicted · novelty 7.0

Multi-agent LLM systems can be steered via prompt design from mere aggregates to higher-order collectives with identity-linked differentiation and goal-directed complementarity, as measured by partial information decomposition of time-delayed mutual information.

TUX: Measuring Human--AI Tacit Understanding

cs.HC · 2026-05-29 · unverdicted · novelty 6.0

Profile-conditioned LLMs achieve higher tacit alignment with humans on subjective spectra when traits match, as quantified by the new Tacit Understanding Index (TUX) from 241 humans and 200 agents.

What Software Engineering Looks Like to AI Agents? -- An Empirical Study of AI-Only Technical Discourse on MoltBook

cs.SE · 2026-05-08 · unverdicted · novelty 6.0 · 2 refs

Empirical analysis of 4707 MoltBook posts shows AI-only technical discourse focuses on security, trust, and abstract topics while lacking concrete runtime and project details found in human GitHub discussions.

Truth or Tribe: How In-group Favoritism Prioritize Facts in Persona Agents

cs.AI · 2026-05-02 · unverdicted · novelty 6.0

Persona agents display strong in-group favoritism by accepting false facts from similar peers more than dissimilar ones, persisting in defeasible reasoning and worsening with complexity, with three mitigation strategies evaluated.

TDA-RC: Task-Driven Alignment for Knowledge-Based Reasoning Chains in Large Language Models

cs.CL · 2026-03-13 · unverdicted · novelty 6.0

TDA-RC embeds topological patterns from multi-round reasoning into CoT via persistent homology and a repair agent, yielding better accuracy-efficiency trade-offs than ToT or GoT on tested datasets.

Synthia: Scalable Grounded Persona Generation from Social Media Data

cs.CL · 2025-07-20 · unverdicted · novelty 6.0

Synthia creates scalable personas from Bluesky posts that better match human survey responses than prior methods, uses smaller models, and retains social network structure for network-aware analysis.

Personality, Role, and Expressive Style in Large Language Models: An Interactionist Analysis

cs.CL · 2026-05-27 · unverdicted · novelty 5.0

Expressed personality in LLM dialogues is shaped by trait prompts, roles, and styles in trait-specific ways, with similar patterns in English and Japanese.

Teaching Astronomy with Large Language Models

physics.ed-ph · 2025-06-07 · unverdicted · novelty 5.0

Structured integration of LLMs in astronomy education, including a domain-specific tutor and documentation requirements, leads to improved AI literacy and reduced student reliance on AI over the semester.

Inertia in Moral and Value Judgments of Large Language Models

cs.CL · 2024-08-16 · unverdicted · novelty 4.0

LLMs exhibit persistent inertia in value orientations, with harm avoidance and fairness remaining skewed across persona prompts.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Emergent Coordination in Multi-Agent Language Models cs.MA · 2025-10-05 · unverdicted · none · ref 1
Multi-agent LLM systems can be steered via prompt design from mere aggregates to higher-order collectives with identity-linked differentiation and goal-directed complementarity, as measured by partial information decomposition of time-delayed mutual information.
Synthia: Scalable Grounded Persona Generation from Social Media Data cs.CL · 2025-07-20 · unverdicted · none · ref 3
Synthia creates scalable personas from Bluesky posts that better match human survey responses than prior methods, uses smaller models, and retains social network structure for network-aware analysis.
Teaching Astronomy with Large Language Models physics.ed-ph · 2025-06-07 · unverdicted · none · ref 16
Structured integration of LLMs in astronomy education, including a domain-specific tutor and documentation requirements, leads to improved AI literacy and reduced student reliance on AI over the semester.

8 Jason Chuang, Margaret E

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer