The text local- ization module identifies text areas within images, while the text extraction module serves as an OCR engine, converting pixel-level text into machine-encoded text

METHODS In this study, we employ the PHI pipeline put forward by [7], which comprises three integral components: text localization, text extraction, text analysis (Figure 1)

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Towards Selection of Large Multimodal Models as Engines for Burned-in Protected Health Information Detection in Medical Images

cs.CV · 2025-11-03 · unverdicted · novelty 4.0

Empirical benchmark of GPT-4o, Gemini 2.5 Flash, and Qwen 2.5 7B finds superior OCR performance over EasyOCR but inconsistent gains in overall PHI detection accuracy, with strongest improvements on complex imprint patterns.

citing papers explorer

Showing 1 of 1 citing paper.

Towards Selection of Large Multimodal Models as Engines for Burned-in Protected Health Information Detection in Medical Images cs.CV · 2025-11-03 · unverdicted · none · ref 2
Empirical benchmark of GPT-4o, Gemini 2.5 Flash, and Qwen 2.5 7B finds superior OCR performance over EasyOCR but inconsistent gains in overall PHI detection accuracy, with strongest improvements on complex imprint patterns.

The text local- ization module identifies text areas within images, while the text extraction module serves as an OCR engine, converting pixel-level text into machine-encoded text

fields

years

verdicts

representative citing papers

citing papers explorer