Physician oversight reveals high error rates in LLM-generated labels for a clinical benchmark and demonstrates that corrected labels improve both evaluation accuracy and downstream model training.
Ambient Artificial Intelligence Scribes to Alleviate the Burden of Clinical Documentation
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
MDwAIstScheduler is a low-cost, hidden voice assistant that automatically creates calendar events from spoken commands using Raspberry Pi hardware and cloud LLM processing.
citing papers explorer
-
Scalable Stewardship of an LLM-Assisted Clinical Benchmark with Physician Oversight
Physician oversight reveals high error rates in LLM-generated labels for a clinical benchmark and demonstrates that corrected labels improve both evaluation accuracy and downstream model training.
-
MDwAIstScheduler: A Low-Cost, Voice-Activated Device for Hands-Free Clinical Scheduling
MDwAIstScheduler is a low-cost, hidden voice assistant that automatically creates calendar events from spoken commands using Raspberry Pi hardware and cloud LLM processing.