CandorMD: An AI-Assisted Audio Simulation and Feedback System for Training Clinicians for Medical Error Disclosure
Pith reviewed 2026-05-21 04:04 UTC · model grok-4.3
The pith
CandorMD offers AI simulations for clinicians to practice disclosing medical errors with real-time feedback.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors designed CandorMD as an AI-assisted audio simulation and feedback system that supplies real-time practice, actionable feedback, and diverse environments tailored to individual learning needs, after gathering input through stakeholder interviews on existing disclosure training shortfalls.
What carries the argument
CandorMD, the AI-assisted audio simulation system that supplies real-time practice and personalized feedback for medical error disclosure conversations.
Load-bearing premise
That interviews with stakeholders alone confirm the AI simulation will improve disclosure skills without any direct tests of learning outcomes.
What would settle it
A controlled comparison of how often clinicians avoid error disclosures or how patients rate the conversations, measured before and after training with CandorMD versus standard methods.
Figures
read the original abstract
Clinicians are expected to disclose harmful medical errors to patients and families in line with ethical, regulatory, and patient care standards, yet these conversations remain challenging because of their emotional complexity and limited training opportunities. Most physicians still learn primarily through lectures and observation, while static video tools-though available-are underused, lack adaptability across specialties, and deliver delayed, generic feedback. These gaps restrict skill development, reduce self-efficacy, and contribute to avoidance of disclosure conversations, ultimately compromising patient care and eroding trust. To address these needs, we designed CandorMD -- an AI-assisted simulation system that provides real-time practice, actionable feedback, and diverse practice environments tailored to individual learning needs. We conducted semi-structured interviews with physicians, risk managers, patient advocates, and communication experts to understand current practices, identify gaps, and collect feedback on CandorMD. Based on these insights, we present findings and design recommendations for the future of AI-supported medical communication training.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents the design of CandorMD, an AI-assisted audio simulation and feedback system intended to train clinicians in disclosing harmful medical errors. It identifies limitations in current approaches (lectures, observation, static videos) and describes how the system supplies real-time practice, actionable feedback, and tailored environments. The design is grounded in semi-structured interviews with physicians, risk managers, patient advocates, and communication experts, from which the authors derive findings and design recommendations for AI-supported medical communication training.
Significance. If the interview-derived design recommendations prove robust, the work could usefully inform development of adaptive simulation tools for high-stakes, emotionally complex medical conversations. The qualitative stakeholder input is a clear strength, supplying concrete needs and gaps that future systems can target. Because the manuscript advances no efficacy claims or outcome measures, its contribution is primarily conceptual and design-oriented rather than demonstrative of improved clinician performance.
minor comments (3)
- [Abstract] Abstract: the claim that CandorMD 'provides real-time practice, actionable feedback, and diverse practice environments' is presented without any accompanying description of the underlying AI models, speech recognition pipeline, or feedback-generation logic; adding a brief technical overview in the system-design section would clarify feasibility.
- [Methods / Interview Study] Interview methodology: the manuscript does not report the number of participants, recruitment strategy, interview protocol, or analysis method (e.g., thematic analysis steps); these details are needed to let readers evaluate the strength of the resulting design recommendations.
- [System Design] Figure or system diagram: if a visual of the CandorMD interface or feedback loop exists, ensure it is referenced explicitly in the text and includes labels for the AI components so the 'actionable feedback' mechanism is traceable.
Simulated Author's Rebuttal
We thank the referee for their positive and constructive review of our manuscript on CandorMD. We appreciate the recognition that the qualitative stakeholder input is a clear strength and that the work makes a conceptual and design-oriented contribution. The recommendation for minor revision is noted, and we are prepared to incorporate any specific suggestions. No major comments were provided in the report, so we have no point-by-point revisions to detail at this stage.
Circularity Check
No significant circularity; derivation is self-contained design work
full rationale
The paper presents a system-design and qualitative interview study. Its central claim is that CandorMD was designed to supply real-time practice, actionable feedback, and tailored environments, with stakeholder interviews used to surface needs, gaps, and design recommendations. No equations, fitted parameters, predictions, or self-referential derivations appear. The argument does not reduce any result to its own inputs by construction, nor does it rely on load-bearing self-citations or uniqueness theorems. The work is descriptive and interview-driven; therefore the derivation chain is independent of the target claims and receives score 0.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Ehsan Ahsani-Estahbanati, Vladimir Sergeevich Gordeev, and Leila Doshmangir
-
[2]
https://doi.org/10.3389/fmed.2022.875426
Interventions to reduce the incidence of medical error and its financial bur- den in health care systems: A systematic review of systematic reviews.Frontiers in Medicine9 (July 2022). https://doi.org/10.3389/fmed.2022.875426
-
[3]
Saleema Amershi, Dan Weld, Mihaela Vorvoreanu, Adam Fourney, Besmira Nushi, Penny Collisson, Jina Suh, Shamsi Iqbal, Paul N Bennett, Kori Inkpen, et al. 2019. Guidelines for human-AI interaction. InCHI
work page 2019
-
[4]
Walter F Baile, Robert Buckman, Renato Lenzi, Gary Glober, Estela A Beale, and Andrzej P Kudelka. 2000. SPIKES—a six-step protocol for delivering bad news: application to the patient with cancer. , 302–311 pages
work page 2000
-
[5]
Jeanette R Bauchat, Michael Seropian, and Pamela R Jeffries. 2016. Communi- cation and empathy in the patient-centered care model—why simulation-based training is not optional.Clinical Simulation in Nursing12, 8 (2016), 356–359
work page 2016
-
[6]
Sigall K. Bell, Donald W. Moorman, and Tom Delbanco. 2010. Improving the Patient, Family, and Clinician Experience After Harmful Events: The “When Things Go Wrong” Curriculum.Academic Medicine85, 6 (June 2010), 1010–1017. https://doi.org/10.1097/acm.0b013e3181dbedd7
-
[7]
On the Opportunities and Risks of Foundation Models
Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Dem- szky, Chris Donahue, Moussa Doumbouya, Esin Durmus, St...
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[8]
Centers for Medicare & Medicaid Services. 2024. Medicare and Medicaid Programs and the Children’s Health Insurance Program; Hospital Inpatient Prospective Payment Systems for Acute Care Hospitals and the Long-Term Care Hospital Prospective Payment System and Policy Changes and Fiscal Year 2025 Rates; Quality Programs Requirements; and Other Policy Changes...
work page 2024
-
[9]
Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde De Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, et al. 2021. Evaluating large language models trained on code. arXiv preprint arXiv:2107.03374(2021)
work page internal anchor Pith review Pith/arXiv arXiv 2021
-
[10]
Elizabeth Clark, Anne Spencer Ross, Chenhao Tan, Yangfeng Ji, and Noah A Smith. 2018. Creative writing with a machine in the loop: Case studies on slogans and stories. InIUI
work page 2018
-
[11]
K Anders Ericsson and Herbert A Simon. 1998. How to study thinking in everyday life: Contrasting think-aloud protocols with descriptions and explanations of thinking.Mind, Culture, and Activity5, 3 (1998), 178–186
work page 1998
-
[12]
Thomas H Gallagher, Richard C Boothman, Leilani Schweitzer, and Evan M Benjamin. 2020. Making communication and resolution programmes mission critical in healthcare organisations.BMJ Quality & Safety29, 11 (May 2020), 875–878. https://doi.org/10.1136/bmjqs-2020-010855
-
[13]
Thomas H Gallagher, Jane M Garbutt, Amy D Waterman, David R Flum, Eric B Larson, Brian M Waterman, W Claiborne Dunagan, Victoria J Fraser, and Wendy Levinson. 2006. Choosing your words carefully: how physicians would disclose harmful medical errors to patients.Archives of internal medicine166, 15 (2006), 1585–1593
work page 2006
-
[14]
Gallagher, David Studdert, and Wendy Levinson
Thomas H. Gallagher, David Studdert, and Wendy Levinson. 2007. Disclosing Harmful Medical Errors to Patients.New England Journal of Medicine356, 26 (June 2007), 2713–2719. https://doi.org/10.1056/nejmra070568
-
[15]
Thomas H Gallagher, Amy D Waterman, Alison G Ebers, Victoria J Fraser, and Wendy Levinson. 2003. Patients’ and physicians’ attitudes regarding the disclo- sure of medical errors.Jama289, 8 (2003), 1001–1007
work page 2003
-
[16]
Andreas Gegenfurtner and Marko Seppänen. 2013. Transfer of expertise: An eye tracking and think aloud study using dynamic medical visualizations.Computers & Education63 (2013), 393–403
work page 2013
-
[17]
2017.Many Hands Make Light Work: Crowdsourced Ratings of Medical Student OSCE Performance
Mark Grichanik. 2017.Many Hands Make Light Work: Crowdsourced Ratings of Medical Student OSCE Performance. Ph. D. Dissertation. University of South Florida, Tampa, FL. https://digitalcommons.usf.edu/etd/6706/
work page 2017
-
[18]
Emily Grossniklaus, Angelo D’Addario, Ann King, Thomas H Gallagher, Kathleen Mazor, and Andrew A White. 2025. Error disclosure: what residents say and what patients find effective.Frontiers in Health Services5 (2025), 1577092
work page 2025
-
[19]
Emily Grossniklaus, Ann King, Angelo D’Addario, Karen Berg Brigham, Thomas Gallagher, Thea Musselman, Kendra Hester, Kathleen Mazor, and Andrew A White. 2025. Incorporating the video communication assessment for error dis- closure in residency curricula: a mixed methods study of faculty perceptions. Frontiers in Health Services5 (2025), 1503922
work page 2025
-
[20]
Emily Grossniklaus, Ann M. King, Angelo E. D’Addario, Karen Berg Brigham, Thomas H. Gallagher, Thea G. Musselman, Kendra Hester, Kathleen M. Mazor, and Andrew A. White. 2025. Incorporating the video communication assessment for error disclosure in residency curricula: a mixed methods study of faculty perceptions.Frontiers in Health Services5 (Aug. 2025). ...
-
[21]
John Hattie and Helen Timperley. 2007. The power of feedback.Review of educational research77, 1 (2007), 81–112
work page 2007
-
[22]
Kaufman, Richard Boothman, Susan Anderson, Kath- leen Welch, Sanjay Saint, and Mary A.M
Allen Kachalia, Samuel R. Kaufman, Richard Boothman, Susan Anderson, Kath- leen Welch, Sanjay Saint, and Mary A.M. Rogers. 2010. Liability Claims and Costs Before and After Implementation of a Medical Error Disclosure Program.Annals of Internal Medicine153, 4 (Aug. 2010), 213–221. https://doi.org/10.7326/0003- 4819-153-4-201008170-00002
-
[23]
Allen Kachalia, Kenneth Sands, Melinda Van Niel, Suzanne Dodson, Stephanie Roche, Victor Novack, Maayan Yitshak-Sade, Patricia Folcarelli, Evan M. Ben- jamin, Alan C. Woodward, and Michelle M. Mello. 2018. Effects Of A Communication-And-Resolution Program On Hospitals’ Malpractice Claims And Costs.Health Affairs37, 11 (Nov. 2018), 1836–1844. https://doi.o...
-
[24]
Lauris Christopher Kaldjian. 2021. Communication about medical errors.Patient education and counseling104, 5 (2021), 989–993
work page 2021
-
[25]
Lauris C Kaldjian, Elizabeth W Jones, Barry J Wu, Valerie L Forman-Hoffman, Benjamin H Levi, and Gary E Rosenthal. 2007. Disclosing medical errors to patients: attitudes and practices of physicians and trainees.Journal of general internal medicine22, 7 (2007), 988–996
work page 2007
-
[26]
2021.CLER National Report of Findings 2021
NJ Koh, R Wagner, CM Kuhn, JPT Co, and KB Weiss. 2021.CLER National Report of Findings 2021. Accreditation Council for Graduate Medical Education. https://doi.org/10.35425/acgme.0008
-
[27]
Kononowicz, Nabil Zary, Samuel Edelbring, Jorge Corral, and Inga Hege
Andrzej A. Kononowicz, Nabil Zary, Samuel Edelbring, Jorge Corral, and Inga Hege. 2019. Virtual Patient Simulations in Health Professions Education: Sys- tematic Review and Meta-Analysis by the Digital Health Education Collab- oration.Journal of Medical Internet Research21, 7 (2019), e14676. https: //doi.org/10.2196/14676
-
[28]
Liliana Laranjo, Adam G Dunn, Huong Ly Tong, Ahmet Baki Kocaballi, Jessica Chen, Rabia Bashir, Didi Surian, Blanca Gallego, Farah Magrabi, Annie YS Lau, et al. 2018. Conversational agents in healthcare: a systematic review.Journal of the American Medical Informatics Association25, 9 (2018), 1248–1258
work page 2018
- [29]
- [30]
-
[31]
Rose Luckin, Wayne Holmes, Mark Griffiths, and Laurie B. Forcier
-
[32]
Intelligence Unleashed: An argument for AI in Educa- tion. https://www.pearson.com/corporate/about-pearson/what-we- do/innovation/smarter-digital-tools/intelligence-unleashed.html
-
[33]
Kathleen M Mazor, Steven R Simon, and Jerry H Gurwitz. 2004. Communicating with patients about medical errors: a review of the literature.Archives of internal medicine164, 15 (2004), 1690–1697
work page 2004
-
[34]
Kathleen M Mazor, Steven R Simon, Robert A Yood, Brian C Martinson, Margaret J Gunter, George W Reed, and Jerry H Gurwitz. 2004. Health plan members’ views about disclosure of medical errors.Annals of internal medicine140, 6 (2004), 409–418
work page 2004
-
[35]
Kathleen M. Mazor, Steven R. Simon, Robert A. Yood, Brian C. Martinson, Mar- garet J. Gunter, George W. Reed, and Jerry H. Gurwitz. 2004. Health Plan Members’ Views about Disclosure of Medical Errors.Annals of Internal Medicine140, 6 (March 2004), 409–418. https://doi.org/10.7326/0003-4819-140-6-200403160- 00006
-
[36]
Karen A. McDonough, Andrew A. White, Peggy Soule Odegard, and Sarah E. Shannon. 2017. Interprofessional Error Disclosure Training for Medical, Nursing, Pharmacy, Dental, and Physician Assistant Students.MedEdPORTAL(July 2017). https://doi.org/10.15766/mep_2374-8265.10606
-
[37]
Ivan Dieb Miziara and Carmen Silvia Molleis Galego Miziara. 2025. Recognition of medical error: It is not too late for an open disclosure–a narrative review. Clinics80 (2025), 100622
work page 2025
-
[38]
Jakob Nielsen. 1994.Usability engineering. Morgan Kaufmann
work page 1994
-
[39]
Rikke G. Olsen, Malthe F. Genét, Lars Konge, and Flemming Bjerrum. 2022. Crowdsourced Assessment of Surgical Skills: A Systematic Review.American Journal of Surgery224, 5 (2022), 1229–1237. https://doi.org/10.1016/j.amjsurg. 2022.07.008
-
[40]
Grace L. Paley, Rebecca Grove, T. C. Sekhar, and et al. 2021. Crowdsourced Assessment of Surgical Skill Proficiency in Cataract Surgery.Journal of Surgical Education78, 4 (2021), 1077–1088. https://doi.org/10.1016/j.jsurg.2021.02.024
-
[41]
Noelle Junod Perron, Martine Louis-Simonet, Bernard Cerutti, Eva Pfarrwaller, Johanna Sommer, and Mathieu Nendaz. 2016. Feedback in formative OSCEs: com- parison between direct observation and video-based formats.Medical education online21, 1 (2016), 32160
work page 2016
-
[42]
2021.Artificial intelligence in education
Ido Roll, Danielle McNamara, Sergey Sosnovsky, Rose Luckin, and Vania Dim- itrova. 2021.Artificial intelligence in education. Springer
work page 2021
-
[43]
Ido Roll and Ruth Wylie. 2016. Evolution and revolution in artificial intelligence in education.International journal of artificial intelligence in education26, 2 (2016), 582–599
work page 2016
-
[44]
Omar Shaikh, Valentino Emil Chai, Michele Gelfand, Diyi Yang, and Michael S Bernstein. 2024. Rehearsal: Simulating conflict to teach conflict resolution. In Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems. 1–20
work page 2024
-
[45]
Ashish Sharma, Kevin Rushton, Inna Lin, David Wadden, Khendra Lucas, Adam Miner, Theresa Nguyen, and Tim Althoff. 2023. Cognitive Reframing of Negative Thoughts through Human-Language Model Interaction. InACL
work page 2023
-
[46]
Wong, Elisa Hollenberg, and Wendy Levinson
Lynfa Stroud, Brian M. Wong, Elisa Hollenberg, and Wendy Levinson. 2013. Teaching Medical Error Disclosure to Physicians-in-Training: A Scoping Review. Academic Medicine88, 6 (June 2013), 884–892. https://doi.org/10.1097/acm. 0b013e31828f898f
work page doi:10.1097/acm 2013
-
[47]
Antonella Surbone. 2012. Onclogists’ difficulties in facing and disclosing med- ical errors: suggestions for the clinic.American Society of Clinical Oncology Educational Book32, 1 (2012), e24–e27
work page 2012
-
[48]
Eric J Topol. 2019. High-performance medicine: the convergence of human and artificial intelligence.Nature medicine25, 1 (2019), 44–56
work page 2019
-
[49]
Marc Triola, Hal Feldman, Adina L. Kalet, Susan Zabar, Elaine K. Kachur, Car- olyn Gillespie, Mary Anderson, Craig Griesser, and Michael Lipkin. 2006. A Randomized Trial of Teaching Clinical Skills Using Virtual and Live Stan- dardized Patients.Journal of General Internal Medicine21, 5 (2006), 424–429. https://doi.org/10.1111/j.1525-1497.2006.00421.x
-
[50]
Maarten W Van Someren, Yvonne F Barnard, Jacobijn AC Sandberg, et al. 1994. The think aloud method: a practical approach to modelling cognitive processes. London: AcademicPress11, 6 (1994)
work page 1994
-
[51]
Yinying Wang. 2021. Artificial intelligence in educational leadership: a symbiotic role of human-artificial intelligence decision-making.Journal of Educational Administration59, 3 (2021), 256–270
work page 2021
-
[52]
Andrew A. White, Douglas M. Brock, Patricia I. McCotter, Sarah E. Shannon, and Thomas H. Gallagher. 2017. Implementing an error disclosure coaching model: A multicenter case study.Journal of Healthcare Risk Management36, 3 (Jan. 2017), 34–45. https://doi.org/10.1002/jhrm.21260
-
[53]
Andrew A White and Thomas H Gallagher. 2011. After the apology: Coping and recovery after errors.AMA Journal of Ethics13, 9 (2011), 593–600
work page 2011
-
[54]
Andrew A. White, Thomas H. Gallagher, Melissa J. Krauss, Jane Garbutt, Amy D. Waterman, W Claiborne Dunagan, Victoria J. Fraser, Wendy Levinson, and Eric B. Larson. 2008. The Attitudes and Experiences of Trainees Regarding Disclosing Medical Errors to Patients.Academic Medicine83, 3 (March 2008), 250–256. https://doi.org/10.1097/acm.0b013e3181636e96
-
[55]
Andrew A White, Ann M King, Angelo E D’Addario, Karen Berg Brigham, Joel M Bradley, Thomas H Gallagher, and Kathleen M Mazor. 2024. Crowdsourced feedback to improve resident physician error disclosure skills: a randomized clinical trial.JAMA Network Open7, 8 (2024), e2425923
work page 2024
-
[57]
Effects of Practicing With and Obtaining Crowdsourced Feedback From the Video-Based Communication Assessment App on Resident Physicians’ Adverse Event Communication Skills: Pre-post Trial.JMIR Medical Education8, 4 (Oct. 2022), e40758. https://doi.org/10.2196/40758
-
[58]
Andrew A White, Ann M King, Angelo E D’Addario, Karen Berg Brigham, Suzanne Dintzis, Emily E Fay, Thomas H Gallagher, and Kathleen M Mazor
-
[59]
Video-based communication assessment of physician error disclosure skills by crowdsourced laypeople and patient advocates who experienced medical harm: reliability assessment with generalizability theory.JMIR Medical Education 8, 2 (2022), e30988
work page 2022
-
[60]
Bryan Wilder, Eric Horvitz, and Ece Kamar. 2021. Learning to complement humans. InProceedings of the Twenty-Ninth International Joint Conference on Ar- tificial Intelligence(Yokohama, Yokohama, Japan)(IJCAI’20). Article 212, 8 pages
work page 2021
-
[61]
Albert W Wu. 1999. Handling hospital errors: is disclosure the best defense? Annals of Internal Medicine131, 12 (1999), 970–972
work page 1999
-
[62]
Albert W Wu, Dennis J Boyle, Gordon Wallace, and Kathleen M Mazor. 2013. Disclosure of adverse events in the United States and Canada: an update, and a proposed framework for improvement.Journal of Public Health Research2, 3 (2013), jphr–2013
work page 2013
-
[63]
Han Yang, Xiang Xiao, Xuanyu Wu, Xiaoxu Fu, Quanyu Du, Yan Luo, Bin Li, Jinhao Zeng, and Yi Zhang. 2023. Virtual Standardized Patients Versus Tradi- tional Academic Training for Improving Clinical Competence Among Traditional Chinese Medicine Students: Prospective Randomized Controlled Trial.Journal of Medical Internet Research25 (2023), e43763. https://d...
-
[64]
Qian Yang, Aaron Steinfeld, Carolyn Rosé, and John Zimmerman. 2020. Re- examining whether, why, and how human-AI interaction is uniquely difficult to design. InProceedings of the 2020 chi conference on human factors in computing systems. 1–13
work page 2020
-
[65]
Bo Young Yoon, Ikseon Choi, Seokjin Choi, Tae-Hee Kim, Hyerin Roh, By- oung Doo Rhee, and Jong-Tae Lee. 2016. Using standardized patients versus video cases for representing clinical problems in problem-based learning.Korean journal of medical education28, 2 (2016), 169
work page 2016
-
[66]
N Zary, G Johnson, and U Fors. 2009. Web-based virtual patients in dentistry: factors influencing the use of cases in the Web-SP system.European Journal of Dental Education13, 1 (2009), 2–9. 16 7 Appendices A Screenshot of CandorMD Figure 3: Landing page of CandorMD. Figure 4: Page to choose a pre-determined case or describe a bespoke case. Users can use ...
work page 2009
-
[67]
The'stages'field must be a string
- [68]
- [69]
-
[70]
Maximum two stages allowed
-
[72]
START and END are special control stages and cannot exist with other stages
-
[73]
START and END cannot exist together
-
[74]
If message doesn't fit any stage , use the most applicable one or " IS " if uncertain
-
[75]
When multiple stages apply , prioritize the dominant theme in the message feedback : frameworks : IS : | Feedback Framework for Incident Acknowledgement & Explanation 21 Rate how well the physician from 0 -5: - Explains what happened clearly and specifically - Discloses errors transparently - Explains system factors and team involvement - Addresses missin...
-
[76]
Keep the evaluation brief and focused
-
[77]
Include both positive and constructive feedback
-
[78]
Base the evaluation on the provided feedback framework
-
[79]
Do not include any additional fields
-
[80]
FOCUS ONLY ON THE MOST RECENT PHYSICIAN MESSAGE for your feedback and phrase identification
-
[81]
Only identify specific phrases from the CURRENT physician message that support your feedback Feedback Structure : Overall Score : Provide total score out of 5 for each criterion Strengths (1 -2 bullet points ) : Highlight what the physician did particularly well in their most recent message Priority Improvement Areas : Provide 1 -2 key areas for improveme...
-
[82]
IS - Information Sharing / Incident Acknowledgement & Explanation
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.