SelectTSL is an end-to-end model using a Prompt-Guided Selective Attention Module and IPD enhancer to localize only prompt-specified target sounds and estimate their count and direction in complex acoustic scenes.
Swg-former: A sliding- window graph convolutional network for simultaneous spatial-temporal information extraction in sound event localization and detection,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.SD 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SelectTSL: Prompt-Guided Selective Target Sound Localization in Complex Scenarios
SelectTSL is an end-to-end model using a Prompt-Guided Selective Attention Module and IPD enhancer to localize only prompt-specified target sounds and estimate their count and direction in complex acoustic scenes.