How far are LLMs from being our digital twins? a benchmark for persona-based behavior chain simulation

Rui Li, Heming Xia, Xinfeng Yuan, Qingxiu Dong, Lei Sha, Wenjie Li, Zhifang Sui · arXiv 2502.14642

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

BehaviorBench: Modeling Real-World User Decisions from Behavioral Traces

cs.AI · 2026-06-01 · unverdicted · novelty 7.0

BehaviorBench reconstructs 2,000 real wallets into 141k belief and 1.4M trade prediction tasks to test if personalization from history improves model performance over non-personalized baselines.

citing papers explorer

Showing 1 of 1 citing paper after filters.

BehaviorBench: Modeling Real-World User Decisions from Behavioral Traces cs.AI · 2026-06-01 · unverdicted · none · ref 6
BehaviorBench reconstructs 2,000 real wallets into 141k belief and 1.4M trade prediction tasks to test if personalization from history improves model performance over non-personalized baselines.

How far are LLMs from being our digital twins? a benchmark for persona-based behavior chain simulation

fields

years

verdicts

representative citing papers

citing papers explorer