V oxCeleb2: Deep speaker recognition,

· 2018

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

Ring Mixing with Auxiliary Signal-to-Consistency-Error Ratio Loss for Unsupervised Denoising in Speech Separation

eess.AS · 2026-04-09 · unverdicted · novelty 7.0

Ring mixing and SCER loss break symmetry in noisy speech separation training, allowing models to learn denoising from noisy mixtures alone and halve residual noise on benchmarks.

LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition

eess.AS · 2026-04-30 · unverdicted · novelty 6.0

LRS-VoxMM is a new in-the-wild AVSR benchmark that is harder than LRS3 and demonstrates increasing value of visual information under acoustic degradation.

UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition

eess.AS · 2026-04-28 · unverdicted · novelty 5.0

Feeding noisy and enhanced speech together into a speaker encoder with EMA adaptation from clean pre-training improves recognition accuracy under noise.

citing papers explorer

Showing 3 of 3 citing papers.

Ring Mixing with Auxiliary Signal-to-Consistency-Error Ratio Loss for Unsupervised Denoising in Speech Separation eess.AS · 2026-04-09 · unverdicted · none · ref 28
Ring mixing and SCER loss break symmetry in noisy speech separation training, allowing models to learn denoising from noisy mixtures alone and halve residual noise on benchmarks.
LRS-VoxMM: A benchmark for in-the-wild audio-visual speech recognition eess.AS · 2026-04-30 · unverdicted · none · ref 20
LRS-VoxMM is a new in-the-wild AVSR benchmark that is harder than LRS3 and demonstrates increasing value of visual information under acoustic degradation.
UNet-Based Fusion and Exponential Moving Average Adaptation for Noise-Robust Speaker Recognition eess.AS · 2026-04-28 · unverdicted · none · ref 12
Feeding noisy and enhanced speech together into a speaker encoder with EMA adaptation from clean pre-training improves recognition accuracy under noise.

V oxCeleb2: Deep speaker recognition,

fields

years

verdicts

representative citing papers

citing papers explorer