A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS)

· 2026 · eess.AS · arXiv 2604.27403

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We propose a knowledge-driven approach to speech target extraction in the presence of background sound effects already recorded in cinematic audio. The specific knowledge sources studied are manners of articulation that are detected in speech frames and adopted to form a knowledge vector as a part of features to enhance speech separation and target speech extraction because some short speech segments are often difficult to separate from mixed background sounds. Testing on the recent Sound Demixing Challenge data for cinematic audio source separation (CASS) shows that utilizing articulator-aware knowledge sources produces better separation results than those obtained without using any knowledge, especially for speech segments buried in unspecified background sound events.

representative citing papers

A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS)

eess.AS · 2026-04-30 · unverdicted · novelty 4.0

Detecting manners of articulation and adding them as knowledge features improves target speech extraction in cinematic audio with background sounds.

citing papers explorer

Showing 1 of 1 citing paper.

A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS) eess.AS · 2026-04-30 · unverdicted · none · ref 2 · internal anchor
Detecting manners of articulation and adding them as knowledge features improves target speech extraction in cinematic audio with background sounds.

A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS)

fields

years

verdicts

representative citing papers

citing papers explorer