pith. sign in

arxiv: 1904.12769 · v1 · pith:M3EJ6HZBnew · submitted 2019-04-29 · 💻 cs.SD · cs.LG· eess.AS

Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network

classification 💻 cs.SD cs.LGeess.AS
keywords sourcestrackingcrnnlocalizationdetectionmovingrecurrentacoustic
0
0 comments X
read the original abstract

This paper investigates the joint localization, detection, and tracking of sound events using a convolutional recurrent neural network (CRNN). We use a CRNN previously proposed for the localization and detection of stationary sources, and show that the recurrent layers enable the spatial tracking of moving sources when trained with dynamic scenes. The tracking performance of the CRNN is compared with a stand-alone tracking method that combines a multi-source (DOA) estimator and a particle filter. Their respective performance is evaluated in various acoustic conditions such as anechoic and reverberant scenarios, stationary and moving sources at several angular velocities, and with a varying number of overlapping sources. The results show that the CRNN manages to track multiple sources more consistently than the parametric method across acoustic scenarios, but at the cost of higher localization error.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. SelectTSL: Prompt-Guided Selective Target Sound Localization in Complex Scenarios

    cs.SD 2026-07 unverdicted novelty 6.0

    SelectTSL is an end-to-end model using a Prompt-Guided Selective Attention Module and IPD enhancer to localize only prompt-specified target sounds and estimate their count and direction in complex acoustic scenes.