An efficient encoder-decoder architec- ture with top-down attention for speech separation,

· 2022 · arXiv 2209.15200

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models

eess.AS · 2026-01-09 · unverdicted · novelty 6.0

A hybrid two-stage framework pairs a discriminative front-end for interference suppression with a generative decoder-only LM back-end to improve perceptual quality and speaker consistency in target speaker extraction and speech enhancement.

CodecSep: Prompt-Driven Universal Sound Separation on Neural Audio Codec Latents

cs.SD · 2025-09-15 · unverdicted · novelty 6.0

CodecSep performs prompt-driven universal sound separation directly in neural audio codec latents by combining a frozen DAC backbone with a lightweight FiLM-conditioned Transformer masker driven by CLAP embeddings, yielding efficiency gains over AudioSep.

citing papers explorer

Showing 2 of 2 citing papers.

Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models eess.AS · 2026-01-09 · unverdicted · none · ref 24
A hybrid two-stage framework pairs a discriminative front-end for interference suppression with a generative decoder-only LM back-end to improve perceptual quality and speaker consistency in target speaker extraction and speech enhancement.
CodecSep: Prompt-Driven Universal Sound Separation on Neural Audio Codec Latents cs.SD · 2025-09-15 · unverdicted · none · ref 23
CodecSep performs prompt-driven universal sound separation directly in neural audio codec latents by combining a frozen DAC backbone with a lightweight FiLM-conditioned Transformer masker driven by CLAP embeddings, yielding efficiency gains over AudioSep.

An efficient encoder-decoder architec- ture with top-down attention for speech separation,

fields

years

verdicts

representative citing papers

citing papers explorer