MUSE generates realistic, persona-consistent Chinese user responses across domains via self-evolving profiles, role-reversal fine-tuning, and rubric-guided multi-turn RL, outperforming baselines in utterance and session evaluations.
No attrac- tion
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
MUSE: Multi-Domain Chinese User Simulation via Self-Evolving Profiles and Rubric-Guided Alignment
MUSE generates realistic, persona-consistent Chinese user responses across domains via self-evolving profiles, role-reversal fine-tuning, and rubric-guided multi-turn RL, outperforming baselines in utterance and session evaluations.