Multi-Level Sensor Fusion with Deep Learning
read the original abstract
In the context of deep learning, this article presents an original deep network, namely CentralNet, for the fusion of information coming from different sensors. This approach is designed to efficiently and automatically balance the trade-off between early and late fusion (i.e. between the fusion of low-level vs high-level information). More specifically, at each level of abstraction-the different levels of deep networks-uni-modal representations of the data are fed to a central neural network which combines them into a common embedding. In addition, a multi-objective regularization is also introduced, helping to both optimize the central network and the unimodal networks. Experiments on four multimodal datasets not only show state-of-the-art performance, but also demonstrate that CentralNet can actually choose the best possible fusion strategy for a given problem.
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals
Dywave applies wavelet-based hierarchical decomposition to build dynamic, event-aligned tokens for heterogeneous IoT signals, cutting token length by up to 75% while raising accuracy up to 12% on sequence models.
-
Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals
Dywave uses wavelet hierarchical decomposition to create event-aligned compact token sequences for heterogeneous IoT signals, yielding up to 12% accuracy gains and 75% shorter inputs on mainstream sequence models acro...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.