arxiv: 1505.03581 · v1 · pith:TDFT6O6Ynew · submitted 2015-05-14 · 💻 cs.CV

CAT2000: A Large Scale Fixation Dataset for Boosting Saliency Research

Ali Borji , Laurent Itti This is my paper

classification 💻 cs.CV

keywords modelssaliencydatasetimagesbeenlargemodelingprogress

0 comments

read the original abstract

Saliency modeling has been an active research area in computer vision for about two decades. Existing state of the art models perform very well in predicting where people look in natural scenes. There is, however, the risk that these models may have been overfitting themselves to available small scale biased datasets, thus trapping the progress in a local minimum. To gain a deeper insight regarding current issues in saliency modeling and to better gauge progress, we recorded eye movements of 120 observers while they freely viewed a large number of naturalistic and artificial images. Our stimuli includes 4000 images; 200 from each of 20 categories covering different types of scenes such as Cartoons, Art, Objects, Low resolution images, Indoor, Outdoor, Jumbled, Random, and Line drawings. We analyze some basic properties of this dataset and compare some successful models. We believe that our dataset opens new challenges for the next generation of saliency models and helps conduct behavioral studies on bottom-up visual attention.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

UIGaze: How Closely Can VLMs Approximate Human Visual Attention on User Interfaces?
cs.HC 2026-04 accept novelty 6.0

VLMs achieve moderate alignment with human gaze on UIs that improves with longer viewing durations and varies by UI type, capturing exploratory rather than initial fixation patterns.