Single-image crowd counting via multi-column convolutional neural network

Zhang, Y · 2016 · DOI 10.1109/cvpr.2016.70

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

Unveiling the Visual Counting Bottleneck in Vision-Language Models

cs.MM · 2026-05-28 · unverdicted · novelty 6.0

VLMs fail at visual counting extrapolation because they cannot project visual magnitudes onto symbolic tokens, despite intact perceptual representations, supporting a fractured magnitude hypothesis.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Unveiling the Visual Counting Bottleneck in Vision-Language Models cs.MM · 2026-05-28 · unverdicted · none · ref 46
VLMs fail at visual counting extrapolation because they cannot project visual magnitudes onto symbolic tokens, despite intact perceptual representations, supporting a fractured magnitude hypothesis.

Single-image crowd counting via multi-column convolutional neural network

fields

years

verdicts

representative citing papers

citing papers explorer