Tiger: Time-frequency in- terleaved gain extraction and reconstruction for efficient speech separation,

· 2024 · arXiv 2410.01469

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Position-Aware Target Speaker Extraction for Long-Form Multi-Party Conversations: A Diarization-Free Framework for ASR

cs.SD · 2026-06-28 · unverdicted · novelty 6.0

PATSE is a DOA-guided target speaker extraction system that produces speaker-attributed streams for diarization-free ASR in multi-party conversations.

TF-MoE: Time-Frequency Mixture-of-Experts for Efficient Speech Separation

cs.SD · 2026-06-28 · unverdicted · novelty 5.0 · 2 refs

TF-MoE uses dynamic per-frame and per-mel-band expert selection in time and frequency dimensions to improve speech separation performance at comparable compute cost to prior models.

citing papers explorer

Showing 2 of 2 citing papers.

Position-Aware Target Speaker Extraction for Long-Form Multi-Party Conversations: A Diarization-Free Framework for ASR cs.SD · 2026-06-28 · unverdicted · none · ref 31
PATSE is a DOA-guided target speaker extraction system that produces speaker-attributed streams for diarization-free ASR in multi-party conversations.
TF-MoE: Time-Frequency Mixture-of-Experts for Efficient Speech Separation cs.SD · 2026-06-28 · unverdicted · none · ref 25 · 2 links
TF-MoE uses dynamic per-frame and per-mel-band expert selection in time and frequency dimensions to improve speech separation performance at comparable compute cost to prior models.

Tiger: Time-frequency in- terleaved gain extraction and reconstruction for efficient speech separation,

fields

years

verdicts

representative citing papers

citing papers explorer