arXiv preprint arXiv:2305.09212 (2023)

Hu, Y · 2023 · arXiv 2305.09212

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Head-Pose-Aware Visual Speech Recognition with FiLM Modulation

cs.CV · 2026-05-30 · unverdicted · novelty 5.0

HP-VSR-ResFiLM adds a single residual FiLM modulation block conditioned on head pose to a CNN visual encoder, yielding WER of 25.0% on LRS2 and 33.2% on LRS3 under standard training conditions.

citing papers explorer

Showing 1 of 1 citing paper.

Head-Pose-Aware Visual Speech Recognition with FiLM Modulation cs.CV · 2026-05-30 · unverdicted · none · ref 14
HP-VSR-ResFiLM adds a single residual FiLM modulation block conditioned on head pose to a CNN visual encoder, yielding WER of 25.0% on LRS2 and 33.2% on LRS3 under standard training conditions.

arXiv preprint arXiv:2305.09212 (2023)

fields

years

verdicts

representative citing papers

citing papers explorer