SHIELD is a new diverse clinical note dataset paired with distilled small language models that achieve 0.89 span-level precision and 0.88 recall for on-premise PHI de-identification.
Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/uthealth corpus
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SHIELD: A Diverse Clinical Note Dataset and Distilled Small Language Models for Enterprise-Scale De-identification
SHIELD is a new diverse clinical note dataset paired with distilled small language models that achieve 0.89 span-level precision and 0.88 recall for on-premise PHI de-identification.