pith. sign in

arxiv: 1810.03716 · v3 · pith:YYD6WLOHnew · submitted 2018-10-08 · 💻 cs.CV

Saliency Prediction in the Deep Learning Era: Successes, Limitations, and Future Challenges

classification 💻 cs.CV
keywords modelssaliencydeeplargewhataddressedbenchmarksdatasets
0
0 comments X
read the original abstract

Visual saliency models have enjoyed a big leap in performance in recent years, thanks to advances in deep learning and large scale annotated data. Despite enormous effort and huge breakthroughs, however, models still fall short in reaching human-level accuracy. In this work, I explore the landscape of the field emphasizing on new deep saliency models, benchmarks, and datasets. A large number of image and video saliency models are reviewed and compared over two image benchmarks and two large scale video datasets. Further, I identify factors that contribute to the gap between models and humans and discuss remaining issues that need to be addressed to build the next generation of more powerful saliency models. Some specific questions that are addressed include: in what ways current models fail, how to remedy them, what can be learned from cognitive studies of attention, how explicit saliency judgments relate to fixations, how to conduct fair model comparison, and what are the emerging applications of saliency models.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Deep Saliency Models : The Quest For The Loss Function

    cs.CV 2019-07 conditional novelty 6.0

    Varying and combining loss functions in deep visual saliency prediction models produces significant performance gains on fixed architectures that hold across datasets and networks.