pith. sign in

arxiv: 1710.05703 · v1 · pith:DALZMZ2Gnew · submitted 2017-10-03 · 💻 cs.CV · cs.AI· cs.NI

A Survey on Optical Character Recognition System

classification 💻 cs.CV cs.AIcs.NI
keywords characterrecognitionresearchopticalacademicyearsaimedaims
0
0 comments X
read the original abstract

Optical Character Recognition (OCR) has been a topic of interest for many years. It is defined as the process of digitizing a document image into its constituent characters. Despite decades of intense research, developing OCR with capabilities comparable to that of human still remains an open challenge. Due to this challenging nature, researchers from industry and academic circles have directed their attentions towards Optical Character Recognition. Over the last few years, the number of academic laboratories and companies involved in research on Character Recognition has increased dramatically. This research aims at summarizing the research so far done in the field of OCR. It provides an overview of different aspects of OCR and discusses corresponding proposals aimed at resolving issues of OCR.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning

    cs.AI 2025-09 unverdicted novelty 7.0

    DeFacto trains multimodal models using counterfactual image variants and reinforcement learning rewards to improve both answer accuracy and evidence-answer consistency.

  2. A Catalog of Data Errors

    cs.DB 2026-04 unverdicted novelty 6.0

    A new catalog classifying 35 data error types into missing, incorrect, and redundant categories for tabular data, with definitions and examples to improve data quality management.

  3. Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models

    cs.DL 2026-03 unverdicted novelty 6.0

    A pipeline combining specialized OCR with Vision-Language Models improves transcription quality and speaker identification for Italian parliamentary speeches preserved as scanned documents.

  4. DeFacto: Counterfactual Thinking with Images for Enforcing Evidence-Grounded and Faithful Reasoning

    cs.AI 2025-09 unverdicted novelty 6.0

    DeFacto trains multimodal models with counterfactual image variants and GRPO reinforcement learning to enforce that correct answers are supported by correct visual evidence.