We evaluate both the OCR transcription quality and the speaker tagging accuracy using the benchmark dataset re- leased by the authors

Evaluation To assess the effectiveness of the proposed pipeline, we conduct a comparative evaluation against IPSA (Frasnelli, Palmero Aprosio · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models

cs.DL · 2026-03-30 · unverdicted · novelty 6.0

A pipeline combining specialized OCR with Vision-Language Models improves transcription quality and speaker identification for Italian parliamentary speeches preserved as scanned documents.

citing papers explorer

Showing 1 of 1 citing paper.

Transcription and Recognition of Italian Parliamentary Speeches Using Vision-Language Models cs.DL · 2026-03-30 · unverdicted · none · ref 12
A pipeline combining specialized OCR with Vision-Language Models improves transcription quality and speaker identification for Italian parliamentary speeches preserved as scanned documents.

We evaluate both the OCR transcription quality and the speaker tagging accuracy using the benchmark dataset re- leased by the authors

fields

years

verdicts

representative citing papers

citing papers explorer