ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time
read the original abstract
Modern deep learning architectures produce highly accurate results on many challenging semantic segmentation datasets. State-of-the-art methods are, however, not directly transferable to real-time applications or embedded devices, since naive adaptation of such systems to reduce computational cost (speed, memory and energy) causes a significant drop in accuracy. We propose ContextNet, a new deep neural network architecture which builds on factorized convolution, network compression and pyramid representation to produce competitive semantic segmentation in real-time with low memory requirement. ContextNet combines a deep network branch at low resolution that captures global context information efficiently with a shallow branch that focuses on high-resolution segmentation details. We analyse our network in a thorough ablation study and present results on the Cityscapes dataset, achieving 66.1% accuracy at 18.3 frames per second at full (1024x2048) resolution (41.9 fps with pipelined computations for streamed data).
This paper has not been read by Pith yet.
Forward citations
Cited by 2 Pith papers
-
Attention-Mamba: A Mamba-Enhanced Multi-Scale Parallel Inference Network for Medical Image Segmentation
Attention-Mamba uses parallel branches, Recursive Alignment Module, and Mamba-enhanced attention to report highest segmentation accuracy on Synapse, ACDC, ISIC-2018, and PH2 with 14.05M parameters and 8.94 GFLOPs.
-
Importance-Aware Semantic Segmentation with Efficient Pyramidal Context Network for Navigational Assistant Systems
Introduces importance-aware loss and BiERF-PSPNet extension for semantic segmentation tailored to navigational assistant systems, evaluated on CamVid and Cityscapes.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.