arXiv preprint arXiv:2510.25440 , year=

More than a Moment: Towards Coherent Sequences of Audio Descriptions , author= · arXiv 2510.25440

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

cs.CV · 2026-06-22 · unverdicted · novelty 6.0

READ is the first reinforcement-learning framework for training audio-description generators, using sequence-level rewards for reference match, length, format, and context-aware coherence.

citing papers explorer

Showing 1 of 1 citing paper.

READ More than What You See: Reinforcement Learning for Accurate and Coherent Audio Description Generations cs.CV · 2026-06-22 · unverdicted · none · ref 11
READ is the first reinforcement-learning framework for training audio-description generators, using sequence-level rewards for reference match, length, format, and context-aware coherence.

arXiv preprint arXiv:2510.25440 , year=

fields

years

verdicts

representative citing papers

citing papers explorer