arXiv preprint arXiv:2508.03905 , year=

Haofei Yu, Zhengyang Qi, Yining Zhao, Kolby Nottingham, Keyang Xuan, Bodhisattwa Prasad Majumder, Hao Zhu, Paul Pu Liang, Jiaxuan You · 2025 · arXiv 2508.03905

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

representative citing papers

GRAPHIA: Harnessing Social Graph Data to Enhance LLM-Based Social Simulation

cs.SI · 2025-10-28 · unverdicted · novelty 7.0

Graphia is an LLM post-training framework that uses real social graphs and GNN rewards to improve micro-level interaction prediction and macro-level network property replication in dynamic social simulations.

Reinforcing Human Behavior Simulation via Verbal Feedback

cs.LG · 2026-05-19 · unverdicted · novelty 6.0

DITTO uses RL with verbal feedback to train LLMs for human behavior simulation, reporting 36% average gains over base models and outperforming GPT-5.4 on 6 of 10 SOUL benchmark tasks.

SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution

cs.AI · 2026-04-21 · unverdicted · novelty 6.0

SAVOIR combines prospective expected utility valuation with Shapley values for fair credit assignment in social dialogue RL, achieving SOTA on SOTOPIA where a 7B model matches or exceeds GPT-4o and Claude-3.5-Sonnet.

SocialCoach: Personalized Social Skill Learning with RL-based Agentic Tutoring and Practice

cs.HC · 2026-06-02 · unverdicted · novelty 4.0

SocialCoach combines multi-agent corpus construction, RL-optimized adaptive scheduling in simulation, and immersive LLM tutoring to deliver personalized social-skill training, reporting gains in simulated pathway quality and judge-rated tutoring quality.

citing papers explorer

Showing 4 of 4 citing papers.

GRAPHIA: Harnessing Social Graph Data to Enhance LLM-Based Social Simulation cs.SI · 2025-10-28 · unverdicted · none · ref 15
Graphia is an LLM post-training framework that uses real social graphs and GNN rewards to improve micro-level interaction prediction and macro-level network property replication in dynamic social simulations.
Reinforcing Human Behavior Simulation via Verbal Feedback cs.LG · 2026-05-19 · unverdicted · none · ref 66
DITTO uses RL with verbal feedback to train LLMs for human behavior simulation, reporting 36% average gains over base models and outperforming GPT-5.4 on 6 of 10 SOUL benchmark tasks.
SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution cs.AI · 2026-04-21 · unverdicted · none · ref 15
SAVOIR combines prospective expected utility valuation with Shapley values for fair credit assignment in social dialogue RL, achieving SOTA on SOTOPIA where a 7B model matches or exceeds GPT-4o and Claude-3.5-Sonnet.
SocialCoach: Personalized Social Skill Learning with RL-based Agentic Tutoring and Practice cs.HC · 2026-06-02 · unverdicted · none · ref 45
SocialCoach combines multi-agent corpus construction, RL-optimized adaptive scheduling in simulation, and immersive LLM tutoring to deliver personalized social-skill training, reporting gains in simulated pathway quality and judge-rated tutoring quality.

arXiv preprint arXiv:2508.03905 , year=

fields

years

verdicts

representative citing papers

citing papers explorer