OctoSense supplies a large multimodal robotics dataset and a late-fusion masked autoencoder that runs fast and outperforms image-only models on optical flow, depth, segmentation, and ego-motion tasks while remaining robust under sensor degradation.
arXiv preprint arXiv:2509.25146 (2025)
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it