pith. sign in

arxiv: 1806.08852 · v3 · pith:HDUAMNBSnew · submitted 2018-06-22 · 💻 cs.CV

Multi-Task Handwritten Document Layout Analysis

classification 💻 cs.CV
keywords documentanalysislayouttexthandwrittenlinesableartificial
0
0 comments X
read the original abstract

Document Layout Analysis is a fundamental step in Handwritten Text Processing systems, from the extraction of the text lines to the type of zone it belongs to. We present a system based on artificial neural networks which is able to determine not only the baselines of text lines present in the document, but also performs geometric and logic layout analysis of the document. Experiments in three different datasets demonstrate the potential of the method and show competitive results with respect to state-of-the-art methods.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. BADAM: A Public Dataset for Baseline Detection in Arabic-script Manuscripts

    cs.CV 2019-07 unverdicted novelty 6.0

    BADAM is a public dataset of 400 annotated Arabic manuscript images paired with a fully convolutional network for baseline detection and text line extraction.

  2. Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction

    cs.MM 2024-10 unverdicted novelty 3.0

    Survey proposing a taxonomy for document parsing into pipeline-based systems and VLM-driven unified models, reviewing components, metrics, benchmarks, and challenges.