RPA-Check is a new multi-stage framework using dimension definition, boolean checklist augmentation, semantic filtering, and LLM-as-judge verification to assess role-playing agents, with tests on a legal training game showing smaller instruction-tuned models can be more consistent than larger ones.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
verdicts
UNVERDICTED 2representative citing papers
Introduces the Rose-Frame framework to diagnose three reasoning traps in human-AI interactions and proposes human-side interventions to reduce epistemic drift.
citing papers explorer
-
RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents
RPA-Check is a new multi-stage framework using dimension definition, boolean checklist augmentation, semantic filtering, and LLM-as-judge verification to assess role-playing agents, with tests on a legal training game showing smaller instruction-tuned models can be more consistent than larger ones.
-
Beyond "Hallucinations": A Framework for Stable Human-AI Reasoning
Introduces the Rose-Frame framework to diagnose three reasoning traps in human-AI interactions and proposes human-side interventions to reduce epistemic drift.