What is wrong with scene text recognition model comparison s? dataset and model analysis

· 1904 · arXiv 1904.01906

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

cs.CL · 2026-02-02 · unverdicted · novelty 7.0

Multimodal LLMs process code as images to achieve up to 8x token compression, with visual cues like syntax highlighting aiding tasks and clone detection remaining resilient or even improving under compression.

ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019

cs.CV · 2019-07-01 · unverdicted · novelty 3.0

The RRC-MLT-2019 report describes an expanded multi-lingual scene text challenge with new tasks, a 20k-image real dataset, synthetic data, and competition outcomes from 60 submissions.

citing papers explorer

Showing 2 of 2 citing papers.

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding cs.CL · 2026-02-02 · unverdicted · none · ref 10
Multimodal LLMs process code as images to achieve up to 8x token compression, with visual cues like syntax highlighting aiding tasks and clone detection remaining resilient or even improving under compression.
ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019 cs.CV · 2019-07-01 · unverdicted · none · ref 23
The RRC-MLT-2019 report describes an expanded multi-lingual scene text challenge with new tasks, a 20k-image real dataset, synthetic data, and competition outcomes from 60 submissions.

What is wrong with scene text recognition model comparison s? dataset and model analysis

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer