pith. sign in

arxiv: 2606.03028 · v1 · pith:S6OCW2RDnew · submitted 2026-06-02 · 💻 cs.SD

Audio Spotforming via Post-Filtering Using Cross-Array Non-target Estimates

classification 💻 cs.SD
keywords low-rankspeechspotformingapproximationsarraysnon-targettargetarray
0
0 comments X
read the original abstract

Audio spotforming is a technique for extracting target speech from noisy mixtures by utilizing multiple microphone arrays. Conventional methods estimate a shared target speech component from linearly separated signals obtained by each array using low-rank approximations and apply post filtering (PF) based on this estimated low-rank representation. However, owing to the mismatch between low-rank models and the complex structure of speech signals, directly relying on low-rank approximations for PF can degrade the speech extraction performance. In this study, we leverage the observation that non-target components located in the target speech direction from the perspective of one array can be spatially separated when viewed from other arrays. This insight motivates a new spotforming method for efficient post-filter estimation using non-target estimates across arrays instead of relying on low-rank approximations. Experiments demonstrate that the proposed method outperforms conventional spotforming methods.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.