pith. sign in

arxiv: 1811.02447 · v1 · pith:26KQLTOGnew · submitted 2018-11-05 · 💻 cs.CV

Multi-Level Sensor Fusion with Deep Learning

classification 💻 cs.CV
keywords fusiondeepnetworkcentralcentralnetdifferentinformationlearning
0
0 comments X
read the original abstract

In the context of deep learning, this article presents an original deep network, namely CentralNet, for the fusion of information coming from different sensors. This approach is designed to efficiently and automatically balance the trade-off between early and late fusion (i.e. between the fusion of low-level vs high-level information). More specifically, at each level of abstraction-the different levels of deep networks-uni-modal representations of the data are fed to a central neural network which combines them into a common embedding. In addition, a multi-objective regularization is also introduced, helping to both optimize the central network and the unimodal networks. Experiments on four multimodal datasets not only show state-of-the-art performance, but also demonstrate that CentralNet can actually choose the best possible fusion strategy for a given problem.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals

    cs.LG 2026-05 unverdicted novelty 6.0

    Dywave applies wavelet-based hierarchical decomposition to build dynamic, event-aligned tokens for heterogeneous IoT signals, cutting token length by up to 75% while raising accuracy up to 12% on sequence models.

  2. Dywave: Event-Aligned Dynamic Tokenization for Heterogeneous IoT Sensing Signals

    cs.LG 2026-05 unverdicted novelty 6.0

    Dywave uses wavelet hierarchical decomposition to create event-aligned compact token sequences for heterogeneous IoT signals, yielding up to 12% accuracy gains and 75% shorter inputs on mainstream sequence models acro...