Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios

Andrea Sikora; Dajiang Zhu; Hanqi Jiang; Huaqin Zhao; Jin Lu; Junhao Chen; Lu Zhang; Quanzheng Li; Shaochen Xu; Tianming Liu

arxiv: 2411.14461 · v1 · pith:SNZ3XPZPnew · submitted 2024-11-16 · 💻 cs.CL · cs.AI· cs.CY

Towards Next-Generation Medical Agent: How o1 is Reshaping Decision-Making in Medical Scenarios

Shaochen Xu , Yifan Zhou , Zhengliang Liu , Zihao Wu , Tianyang Zhong , Huaqin Zhao , Yiwei Li , Hanqi Jiang

show 14 more authors

Yi Pan Junhao Chen Jin Lu Wei Zhang Tuo Zhang Lu Zhang Dajiang Zhu Xiang Li Wei Liu Quanzheng Li Andrea Sikora Xiaoming Zhai Zhen Xiang Tianming Liu

This is my paper

classification 💻 cs.CL cs.AIcs.CY

keywords medicaldecision-makingreasoningagentagentsclinicalcomplexreal-time

0 comments

read the original abstract

Artificial Intelligence (AI) has become essential in modern healthcare, with large language models (LLMs) offering promising advances in clinical decision-making. Traditional model-based approaches, including those leveraging in-context demonstrations and those with specialized medical fine-tuning, have demonstrated strong performance in medical language processing but struggle with real-time adaptability, multi-step reasoning, and handling complex medical tasks. Agent-based AI systems address these limitations by incorporating reasoning traces, tool selection based on context, knowledge retrieval, and both short- and long-term memory. These additional features enable the medical AI agent to handle complex medical scenarios where decision-making should be built on real-time interaction with the environment. Therefore, unlike conventional model-based approaches that treat medical queries as isolated questions, medical AI agents approach them as complex tasks and behave more like human doctors. In this paper, we study the choice of the backbone LLM for medical AI agents, which is the foundation for the agent's overall reasoning and action generation. In particular, we consider the emergent o1 model and examine its impact on agents' reasoning, tool-use adaptability, and real-time information retrieval across diverse clinical scenarios, including high-stakes settings such as intensive care units (ICUs). Our findings demonstrate o1's ability to enhance diagnostic accuracy and consistency, paving the way for smarter, more responsive AI tools that support better patient outcomes and decision-making efficacy in clinical practice.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
cs.CL 2024-12 unverdicted novelty 6.0

HuatuoGPT-o1 achieves superior medical complex reasoning by using a verifier to curate reasoning trajectories for fine-tuning and then applying RL with verifier-based rewards.
HiMed: Incentivizing Hindi Reasoning in Medical LLMs
cs.CL 2026-05 unverdicted novelty 5.0

HiMed releases a Hindi medical reasoning corpus and benchmark and shows that training an 8B LLM with decaying scaffolding reward improves Hindi performance and narrows the English-Hindi accuracy gap.