Multi-Task Handwritten Document Layout Analysis
read the original abstract
Document Layout Analysis is a fundamental step in Handwritten Text Processing systems, from the extraction of the text lines to the type of zone it belongs to. We present a system based on artificial neural networks which is able to determine not only the baselines of text lines present in the document, but also performs geometric and logic layout analysis of the document. Experiments in three different datasets demonstrate the potential of the method and show competitive results with respect to state-of-the-art methods.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
BADAM: A Public Dataset for Baseline Detection in Arabic-script Manuscripts
BADAM is a public dataset of 400 annotated Arabic manuscript images paired with a fully convolutional network for baseline detection and text line extraction.
-
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction
Survey proposing a taxonomy for document parsing into pipeline-based systems and VLM-driven unified models, reviewing components, metrics, benchmarks, and challenges.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.