pith. sign in

arXiv preprint arXiv:2312.15198 , year =

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

years

2026 3 2025 2

verdicts

UNVERDICTED 5

representative citing papers

Explicit Trait Inference for Multi-Agent Coordination

cs.AI · 2026-04-21 · unverdicted · novelty 6.0

ETI lets LLM agents infer and track partners' psychological traits (warmth and competence) from histories, cutting payoff loss 45-77% in games and boosting performance 3-29% on MultiAgentBench versus CoT baselines.

Understanding the Mechanism of Altruism in Large Language Models

econ.GN · 2026-04-21 · unverdicted · novelty 6.0

A small set of sparse autoencoder features in LLMs drives shifts between generous and selfish allocations in dictator games, with causal patching and steering confirming their role and generalization to other social games.

Extreme Self-Preference in Language Models

cs.AI · 2025-09-30 · unverdicted · novelty 6.0

Eight LLMs exhibited massive self-preference that followed assigned identities rather than true ones, appearing in both simple word tasks and consequential evaluations of job candidates and AI technologies.

citing papers explorer

Showing 5 of 5 citing papers.