TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Baoguang Shi; Minghui Liao; Wenyu Liu; Xiang Bai; Xinggang Wang

arxiv: 1611.06779 · v1 · pith:XVAYCRT5new · submitted 2016-11-21 · 💻 cs.CV

TextBoxes: A Fast Text Detector with a Single Deep Neural Network

Minghui Liao , Baoguang Shi , Xiang Bai , Xinggang Wang , Wenyu Liu This is my paper

classification 💻 cs.CV

keywords texttextboxesfastaccuracydetectorend-to-endnetworkoutperforms

0 comments

read the original abstract

This paper presents an end-to-end trainable fast scene text detector, named TextBoxes, which detects scene text with both high accuracy and efficiency in a single network forward pass, involving no post-process except for a standard non-maximum suppression. TextBoxes outperforms competing methods in terms of text localization accuracy and is much faster, taking only 0.09s per image in a fast implementation. Furthermore, combined with a text recognizer, TextBoxes significantly outperforms state-of-the-art approaches on word spotting and end-to-end text recognition tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Multitask Network for Localization and Recognition of Text in Images
cs.CL 2019-06 unverdicted novelty 6.0

Presents an end-to-end multitask CNN with FPN, dynamic RoI pooling, and convolutional attention for simultaneous lexicon-free text localization and recognition in complex images.