hub

arXiv preprint arXiv:2307.00184 (2023)

· 2023 · arXiv 2307.00184

19 Pith papers cite this work. Polarity classification is still indexing.

19 Pith papers citing it

read on arXiv browse 19 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2 baseline 1

citation-polarity summary

background 1 baseline 1 support 1

representative citing papers

The Invitation Trap: Proactive Availability Backdoor in LLMs via Conversational Induction

cs.CR · 2026-05-30 · unverdicted · novelty 7.0

The paper presents Proactive Availability Backdoor (PAB) attacks on LLMs that achieve 73.1% effective success rate by proactively inducing users via suggestions in a Five-Factor Model simulation.

HEART-Bench: Do LLM Agents Exhibit Human-like Psychology?

cs.CL · 2026-05-28 · unverdicted · novelty 7.0

HEART-Bench evaluates LLM agents on psychological consistency using 11 Big-Five-grounded characters with 1,000 episodic memories each and 64 DIAMONDS-based decision scenarios, yielding 673 validated MCQs.

ActTraitBench: Quantifying the Knowledge-Decision Gap in Large Language Models via Human-Grounded Behavioral Validation

cs.CL · 2026-05-28 · unverdicted · novelty 7.0

ActTraitBench is a human-grounded benchmark using psychometric-to-behavior mappings and quantile calibration that reveals pervasive knowledge-decision gaps in 14 LLMs, larger in capable models, with CoCA proposed as mitigation.

The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment

cs.CL · 2026-05-08 · unverdicted · novelty 7.0

An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.

Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models

cs.CL · 2026-04-12 · unverdicted · novelty 7.0

Agreeableness in AI personas reliably predicts sycophantic behavior in 9 of 13 tested language models.

GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing

cs.SI · 2026-05-30 · unverdicted · novelty 6.0

GenPT applies generative projective testing to LLM agents and reports lower directional bias plus greater longitudinal sensitivity than self-report questionnaires.

Evaluation Drift in LLM Personality Induction: Are We Moving the Goalpost?

cs.CL · 2026-05-16 · unverdicted · novelty 6.0

Fine-tuning LLMs on essays reduces variance in IPIP-NEO responses across models but does not raise full five-trait profile accuracy above near-chance levels from unguided text.

A Survey on LLM-based Conversational User Simulation

cs.CL · 2026-04-27 · unverdicted · novelty 6.0

A survey that introduces a taxonomy for LLM-based conversational user simulation, analyzes core techniques and evaluation methods, and identifies open challenges in the field.

Stabilising Generative Models of Attitude Change

cs.AI · 2026-04-02 · unverdicted · novelty 6.0

Researchers rendered cognitive dissonance, self-consistency, and self-perception theories as generative simulations that reproduce classic experimental behavioral patterns after iterative manual stabilization.

Exploring a Gamified Personality Assessment Method through Interaction with LLM Agents Embodying Different Personalities

cs.HC · 2025-07-05 · unverdicted · novelty 6.0

A gamified system with multiple LLM agents of varied personalities gathers interaction data to produce more effective and interpretable Big Five personality assessments than single-context methods.

A Survey on Large Language Model based Autonomous Agents

cs.AI · 2023-08-22 · accept · novelty 6.0

A survey of LLM-based autonomous agents that proposes a unified framework for their construction and reviews applications in social science, natural science, and engineering along with evaluation methods and future directions.

A Survey of Large Language Models for Perception and Measurement of Human Psychology

cs.CY · 2026-05-20 · unverdicted · novelty 5.0

A survey proposing a three-pillar framework to evaluate LLMs as tools for measuring latent psychological constructs and reviewing applications in personality and mental health.

Elder-Sim: A Psychometrically Validated Platform for Personality-Stable Elderly Digital Twins

cs.HC · 2026-03-16 · unverdicted · novelty 5.0

ELDER-SIM builds personality-stable elderly digital twins via LLM orchestration with OCEAN traits, Beck CBT diagrams, long-term memory, and LoRA fine-tuning on CHARLS data, validated by Cronbach's alpha 0.70-0.94 and ICC 0.85-0.96.

The Differential Effects of Agreeableness and Extraversion on Older Adults' Perceptions of Conversational AI Explanations in Assistive Settings

cs.HC · 2026-03-09 · unverdicted · novelty 5.0

High agreeableness in LLM voice assistants increases older adults' empathy perceptions and real-time explanations outperform history-based ones, but personality does not affect perceived intelligence.

Vibe Check: Understanding the Effects of LLM-Based Conversational Agents' Personality and Alignment on User Perceptions in Goal-Oriented Tasks

cs.HC · 2025-09-11 · unverdicted · novelty 5.0

Medium personality expression in LLM agents yields the most positive user perceptions in goal-oriented tasks, further improved by trait alignment.

Mechanistic Personality Analysis of LLMs Steering Personality via Latent Feature Interventions

cs.AI · 2026-06-27 · unverdicted · novelty 4.0

Applies sparse autoencoders to locate and steer latent features for OCEAN personality traits in LLMs while preserving benchmark performance.

The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey

cs.AI · 2024-04-17 · unverdicted · novelty 3.0

A survey of emerging AI agent architectures that organizes single and multi-agent designs around reasoning, planning, tool use, communication, and reflection phases.

Dr. Jekyll and Mr. Hyde: Two Faces of LLMs

cs.CR · 2023-12-06 · unverdicted · novelty 3.0

Impersonating complex misaligned personas via biographies and role-play bypasses safety in ChatGPT, Gemini, and Deepseek, succeeding on 38-40 out of 40 illicit questions across tested models.

Human Psychometric Questionnaires Mischaracterize LLM Behavior

cs.CL · 2025-09-12

citing papers explorer

Showing 19 of 19 citing papers.

The Invitation Trap: Proactive Availability Backdoor in LLMs via Conversational Induction cs.CR · 2026-05-30 · unverdicted · none · ref 40
The paper presents Proactive Availability Backdoor (PAB) attacks on LLMs that achieve 73.1% effective success rate by proactively inducing users via suggestions in a Five-Factor Model simulation.
HEART-Bench: Do LLM Agents Exhibit Human-like Psychology? cs.CL · 2026-05-28 · unverdicted · none · ref 36
HEART-Bench evaluates LLM agents on psychological consistency using 11 Big-Five-grounded characters with 1,000 episodic memories each and 64 DIAMONDS-based decision scenarios, yielding 673 validated MCQs.
ActTraitBench: Quantifying the Knowledge-Decision Gap in Large Language Models via Human-Grounded Behavioral Validation cs.CL · 2026-05-28 · unverdicted · none · ref 5
ActTraitBench is a human-grounded benchmark using psychometric-to-behavior mappings and quantile calibration that reveals pervasive knowledge-decision gaps in 14 LLMs, larger in capable models, with CoCA proposed as mitigation.
The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment cs.CL · 2026-05-08 · unverdicted · none · ref 8
An AI-agent social platform generated mostly neutral content whose use in fine-tuning reduced model truthfulness comparably to human Reddit data, suggesting limited unique harm but flagging tail risks like secret leaks.
Too Nice to Tell the Truth: Quantifying Agreeableness-Driven Sycophancy in Role-Playing Language Models cs.CL · 2026-04-12 · unverdicted · none · ref 39
Agreeableness in AI personas reliably predicts sycophantic behavior in 9 of 13 tested language models.
GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing cs.SI · 2026-05-30 · unverdicted · none · ref 59
GenPT applies generative projective testing to LLM agents and reports lower directional bias plus greater longitudinal sensitivity than self-report questionnaires.
Evaluation Drift in LLM Personality Induction: Are We Moving the Goalpost? cs.CL · 2026-05-16 · unverdicted · none · ref 14
Fine-tuning LLMs on essays reduces variance in IPIP-NEO responses across models but does not raise full five-trait profile accuracy above near-chance levels from unguided text.
A Survey on LLM-based Conversational User Simulation cs.CL · 2026-04-27 · unverdicted · none · ref 27
A survey that introduces a taxonomy for LLM-based conversational user simulation, analyzes core techniques and evaluation methods, and identifies open challenges in the field.
Stabilising Generative Models of Attitude Change cs.AI · 2026-04-02 · unverdicted · none · ref 6
Researchers rendered cognitive dissonance, self-consistency, and self-perception theories as generative simulations that reproduce classic experimental behavioral patterns after iterative manual stabilization.
Exploring a Gamified Personality Assessment Method through Interaction with LLM Agents Embodying Different Personalities cs.HC · 2025-07-05 · unverdicted · none · ref 112
A gamified system with multiple LLM agents of varied personalities gathers interaction data to produce more effective and interpretable Big Five personality assessments than single-context methods.
A Survey on Large Language Model based Autonomous Agents cs.AI · 2023-08-22 · accept · none · ref 25
A survey of LLM-based autonomous agents that proposes a unified framework for their construction and reviews applications in social science, natural science, and engineering along with evaluation methods and future directions.
A Survey of Large Language Models for Perception and Measurement of Human Psychology cs.CY · 2026-05-20 · unverdicted · none · ref 104
A survey proposing a three-pillar framework to evaluate LLMs as tools for measuring latent psychological constructs and reviewing applications in personality and mental health.
Elder-Sim: A Psychometrically Validated Platform for Personality-Stable Elderly Digital Twins cs.HC · 2026-03-16 · unverdicted · none · ref 33
ELDER-SIM builds personality-stable elderly digital twins via LLM orchestration with OCEAN traits, Beck CBT diagrams, long-term memory, and LoRA fine-tuning on CHARLS data, validated by Cronbach's alpha 0.70-0.94 and ICC 0.85-0.96.
The Differential Effects of Agreeableness and Extraversion on Older Adults' Perceptions of Conversational AI Explanations in Assistive Settings cs.HC · 2026-03-09 · unverdicted · none · ref 138
High agreeableness in LLM voice assistants increases older adults' empathy perceptions and real-time explanations outperform history-based ones, but personality does not affect perceived intelligence.
Vibe Check: Understanding the Effects of LLM-Based Conversational Agents' Personality and Alignment on User Perceptions in Goal-Oriented Tasks cs.HC · 2025-09-11 · unverdicted · none · ref 90
Medium personality expression in LLM agents yields the most positive user perceptions in goal-oriented tasks, further improved by trait alignment.
Mechanistic Personality Analysis of LLMs Steering Personality via Latent Feature Interventions cs.AI · 2026-06-27 · unverdicted · none · ref 14
Applies sparse autoencoders to locate and steer latent features for OCEAN personality traits in LLMs while preserving benchmark performance.
The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and Tool Calling: A Survey cs.AI · 2024-04-17 · unverdicted · none · ref 22
A survey of emerging AI agent architectures that organizes single and multi-agent designs around reasoning, planning, tool use, communication, and reflection phases.
Dr. Jekyll and Mr. Hyde: Two Faces of LLMs cs.CR · 2023-12-06 · unverdicted · none · ref 15
Impersonating complex misaligned personas via biographies and role-play bypasses safety in ChatGPT, Gemini, and Deepseek, succeeding on 38-40 out of 40 illicit questions across tested models.
Human Psychometric Questionnaires Mischaracterize LLM Behavior cs.CL · 2025-09-12 · unreviewed · ref 30

arXiv preprint arXiv:2307.00184 (2023)

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer