BADAM is a public dataset of 400 annotated Arabic manuscript images paired with a fully convolutional network for baseline detection and text line extraction.
Multi-Task Handwritten Document Layout Analysis
2 Pith papers cite this work. Polarity classification is still indexing.
abstract
Document Layout Analysis is a fundamental step in Handwritten Text Processing systems, from the extraction of the text lines to the type of zone it belongs to. We present a system based on artificial neural networks which is able to determine not only the baselines of text lines present in the document, but also performs geometric and logic layout analysis of the document. Experiments in three different datasets demonstrate the potential of the method and show competitive results with respect to state-of-the-art methods.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
Survey proposing a taxonomy for document parsing into pipeline-based systems and VLM-driven unified models, reviewing components, metrics, benchmarks, and challenges.
citing papers explorer
-
BADAM: A Public Dataset for Baseline Detection in Arabic-script Manuscripts
BADAM is a public dataset of 400 annotated Arabic manuscript images paired with a fully convolutional network for baseline detection and text line extraction.
-
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction
Survey proposing a taxonomy for document parsing into pipeline-based systems and VLM-driven unified models, reviewing components, metrics, benchmarks, and challenges.