ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents

Deyi Xiong; Jianfeng Li; Jianxiang Peng; Jing Xiao; Linxi Su; Minghui Zhang; Shang Wu; Shaojun Wang; Tianhao Shen; Wei Hu

arxiv: 2407.03884 · v4 · pith:DOMMWXICnew · submitted 2024-07-04 · 💻 cs.CL · cs.AI

ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents

Zhigen Li , Jianxiang Peng , Yanmeng Wang , Yong Cao , Tianhao Shen , Minghui Zhang , Linxi Su , Shang Wu

show 8 more authors

Yihang Wu Yuqian Wang Ye Wang Wei Hu Jianfeng Li Shaojun Wang Jing Xiao Deyi Xiong

This is my paper

classification 💻 cs.CL cs.AI

keywords dialogueagentsmodelsplanningsop-guidedactioncarlochatsop

0 comments

read the original abstract

Dialogue agents powered by Large Language Models (LLMs) show superior performance in various tasks. Despite the better user understanding and human-like responses, their **lack of controllability** remains a key challenge, often leading to unfocused conversations or task failure. To address this, we introduce Standard Operating Procedure (SOP) to regulate dialogue flow. Specifically, we propose **ChatSOP**, a novel SOP-guided Monte Carlo Tree Search (MCTS) planning framework designed to enhance the controllability of LLM-driven dialogue agents. To enable this, we curate a dataset comprising SOP-annotated multi-scenario dialogues, generated using a semi-automated role-playing system with GPT-4o and validated through strict manual quality control. Additionally, we propose a novel method that integrates Chain of Thought reasoning with supervised fine-tuning for SOP prediction and utilizes SOP-guided Monte Carlo Tree Search for optimal action planning during dialogues. Experimental results demonstrate the effectiveness of our method, such as achieving a 27.95% improvement in action accuracy compared to baseline models based on GPT-3.5 and also showing notable gains for open-source models. Dataset and codes are publicly available.

This paper has not been read by Pith yet.

ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents

discussion (0)