PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents

Gernot A. Fink; Sebastian Sudholt

arxiv: 1604.00187 · v3 · pith:RQCGSGR2new · submitted 2016-04-01 · 💻 cs.CV

PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents

Sebastian Sudholt , Gernot A. Fink This is my paper

classification 💻 cs.CV

keywords architectureconvolutionaldeepneuralperformancespottingstatevarious

0 comments

read the original abstract

In recent years, deep convolutional neural networks have achieved state of the art performance in various computer vision task such as classification, detection or segmentation. Due to their outstanding performance, CNNs are more and more used in the field of document image analysis as well. In this work, we present a CNN architecture that is trained with the recently proposed PHOC representation. We show empirically that our CNN architecture is able to outperform state of the art results for various word spotting benchmarks while exhibiting short training and test times.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Pattern Spotting in Historical Documents Using Convolutional Models
cs.CV 2019-06 unverdicted novelty 4.0

RetinaNet multiscale embeddings improve pattern location accuracy and reduce storage needs versus prior methods on the DocExplore dataset, though they fail on pages with multiple query instances.