Low information density is identified as the root cause of NER failures on user-generated content, with the Window-Aware Optimization Module delivering up to 4.5% F1 gains and new SOTA on WNUT2017.
LLMs in biomedicine: a study on clinical named entity recognition
4 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CL 4verdicts
UNVERDICTED 4representative citing papers
Physicians use substantially more risk-focused framing in counseling notes for repeat cesarean than for VBAC among patients clinically eligible for both.
Few-shot prompting with GPT-4.1 achieves an F1 score of 0.65 for extracting and classifying substance use mentions in Spanish clinical texts as part of the ToxHabits shared task.
Fine-tuning and data augmentation improve LLM performance on medical jargon extraction and prioritization from EHR notes, with augmented open-source models sometimes outperforming closed-source ones on 106 annotated notes.
citing papers explorer
-
A Mechanism and Optimization Study on the Impact of Information Density on User-Generated Content Named Entity Recognition
Low information density is identified as the root cause of NER failures on user-generated content, with the Window-Aware Optimization Module delivering up to 4.5% F1 gains and new SOTA on WNUT2017.
-
Implicit Framing in Obstetric Counseling Notes: A Grounded LLM Pipeline on a VBAC-Eligible Cohort
Physicians use substantially more risk-focused framing in counseling notes for repeat cesarean than for VBAC among patients clinically eligible for both.
-
FMI@SU ToxHabits: Evaluating LLMs Performance on Toxic Habit Extraction in Spanish Clinical Texts
Few-shot prompting with GPT-4.1 achieves an F1 score of 0.65 for extracting and classifying substance use mentions in Spanish clinical texts as part of the ToxHabits shared task.
-
Enhancing LLMs for Identifying and Prioritizing Important Medical Jargons from Electronic Health Record Notes Utilizing Data Augmentation
Fine-tuning and data augmentation improve LLM performance on medical jargon extraction and prioritization from EHR notes, with augmented open-source models sometimes outperforming closed-source ones on 106 annotated notes.