pith. machine review for the scientific record. sign in

arxiv: 1609.01064 · v2 · submitted 2016-09-05 · 💻 cs.CV

Recognition: unknown

A Deep Multi-Level Network for Saliency Prediction

Authors on Pith no claims yet
classification 💻 cs.CV
keywords saliencynetworkpredictionconvolutionalfeaturearchitecturedatasetdeep
0
0 comments X
read the original abstract

This paper presents a novel deep architecture for saliency prediction. Current state of the art models for saliency prediction employ Fully Convolutional networks that perform a non-linear combination of features extracted from the last convolutional layer to predict saliency maps. We propose an architecture which, instead, combines features extracted at different levels of a Convolutional Neural Network (CNN). Our model is composed of three main blocks: a feature extraction CNN, a feature encoding network, that weights low and high level feature maps, and a prior learning network. We compare our solution with state of the art saliency models on two public benchmarks datasets. Results show that our model outperforms under all evaluation metrics on the SALICON dataset, which is currently the largest public dataset for saliency prediction, and achieves competitive results on the MIT300 benchmark.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Physics-Informed Temporal U-Net for High-Fidelity Fluid Interpolation

    physics.flu-dyn 2026-04 unverdicted novelty 5.0

    A Temporal U-Net with perceptual loss and a physics-informed parabolic bridge interpolates sparse fluid observations, cutting MAE to 0.015 from 0.085 while retaining high-frequency turbulent structures.