Perception of visual numerosity in humans and machines
Pith reviewed 2026-05-24 20:58 UTC · model grok-4.3
The pith
Deep networks replicate human numerosity perception with number as the main driver but continuous magnitudes exerting early influence
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Deep networks trained on numerosity comparison tasks accurately simulate human psychophysics and developmental changes, where discrimination relies primarily on numerosity information while non-numerical features exert significant impact especially early in development; representational similarity analysis shows that both numerosity and continuous magnitudes are spontaneously encoded even without task requirements.
What carries the argument
Deep neural network performing the numerosity comparison task, with representational similarity analysis measuring alignment of internal encodings to human data
If this is right
- Numerosity discrimination improves as the relative weight of non-numerical features declines across development.
- General visual processing networks can produce numerical behavior without requiring a dedicated number module.
- Both numerical and non-numerical magnitudes arise as salient properties in visual representations even when unprompted by task goals.
Where Pith is reading between the lines
- The same modeling approach could be extended to test whether other early mathematical concepts emerge from the statistics of natural visual input.
- Varying the training distribution or architecture would reveal how experience modulates the balance between numerical and continuous cues.
- If similar spontaneous encoding occurs in networks trained only on object recognition, it would strengthen the claim that numerosity is a byproduct of general vision.
Load-bearing premise
The chosen stimulus space and network training regime generate internal representations and decision behavior close enough to human vision that numerical versus non-numerical contributions can be compared directly.
What would settle it
Human performance data on a new stimulus set where numerosity is fully decorrelated from area and density would falsify the model if the network's predicted error patterns or developmental shifts deviate substantially from the observed human patterns.
Figures
read the original abstract
Numerosity perception is foundational to mathematical learning, but its computational bases are strongly debated. Some investigators argue that humans are endowed with a specialized system supporting numerical representation; others argue that visual numerosity is estimated using continuous magnitudes, such as density or area, which usually co-vary with number. Here we reconcile these contrasting perspectives by testing deep networks on the same numerosity comparison task that was administered to humans, using a stimulus space that allows to measure the contribution of non-numerical features. Our model accurately simulated the psychophysics of numerosity perception and the associated developmental changes: discrimination was driven by numerosity information, but non-numerical features had a significant impact, especially early during development. Representational similarity analysis further highlighted that both numerosity and continuous magnitudes were spontaneously encoded even when no task had to be carried out, demonstrating that numerosity is a major, salient property of our visual environment.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper trains deep convolutional networks on a numerosity comparison task using a stimulus set that decorrelates number from continuous magnitudes (area, density, etc.). It reports that the networks reproduce human psychophysical signatures and developmental trends, with numerosity as the dominant cue but non-numerical features exerting measurable influence (especially early in training). Representational similarity analysis on the networks' layers is used to demonstrate spontaneous encoding of both numerosity and continuous magnitudes even in the absence of an explicit task.
Significance. If the reported simulation holds, the work supplies a concrete computational demonstration that numerosity can emerge as a salient visual feature alongside continuous magnitudes, thereby offering a mechanistic reconciliation of the specialized-system versus magnitude-based accounts of numerical perception. The use of the identical task and stimulus space as the human experiments, together with the RSA analysis of spontaneous representations, strengthens the link between model behavior and developmental psychophysics.
minor comments (3)
- The abstract and introduction refer to 'developmental changes' but the methods section does not specify how the training schedule or data curriculum was aligned with human age groups; a brief clarification of this mapping would aid reproducibility.
- Figure 3 (RSA matrices) would benefit from explicit labeling of the layer indices corresponding to the reported correlation values, as the current caption leaves the mapping between network depth and the plotted layers implicit.
- The stimulus-generation procedure is described at a high level; adding a short supplementary table listing the exact ranges and sampling densities for each continuous magnitude would make the decorrelation claim easier to verify.
Simulated Author's Rebuttal
We thank the referee for their positive summary, recognition of the work's significance, and recommendation for minor revision. The report does not list any major comments.
Circularity Check
No significant circularity
full rationale
The paper trains convolutional networks on a numerosity comparison task using a controlled stimulus space that decorrelates number from continuous magnitudes, then evaluates the resulting model behavior against independent human psychophysics data. No parameters are fitted to the target human discrimination thresholds or developmental trajectories; the comparison is purely external. Representational similarity analysis is performed on the network's internal activations without reference to human data. No self-citation chain, uniqueness theorem, or ansatz imported from prior author work is invoked to justify the central claims. The derivation therefore remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Our model accurately simulated the psychophysics of numerosity perception... Representational similarity analysis further highlighted that both numerosity and continuous magnitudes were spontaneously encoded even when no task had to be carried out.
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Deep belief networks were first trained in a completely unsupervised way... using contrastive divergence
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Dehaene S (2011) The number sense: How the mind creates mathematics (Oxford University Press)
work page 2011
-
[2]
Trends Cogn Sci 14(12):542–551
Piazza M (2010) Neurocognitive start-up tools for symbolic number representations. Trends Cogn Sci 14(12):542–551
work page 2010
-
[3]
Butterworth B (1999) The mathematical brain. (Macmillan, London, UK)
work page 1999
-
[4]
Halberda J, Mazzocco MM, Feigenson L (2008) Individual differences in non-verbal number acuity correlate with maths achievement. Nature 455
work page 2008
-
[5]
Dehaene S (2003) The neural basis of the Weber--Fechner law: a logarithmic mental number line. Trends Cogn Sci 7(4):145–147
work page 2003
-
[6]
Piazza M, Izard V, Pinel P, Le Bihan D, Dehaene S (2004) Tuning curves for approximate numerosity in the human intraparietal sulcus. Neuron 44(3):547–555
work page 2004
-
[7]
Agrillo C, Dadda M, Serena G, Bisazza A (2008) Do fish count? Spontaneous discrimination of quantity in female mosquitofish. Anim Cogn 11(3):495–503. 16
work page 2008
-
[8]
Brannon EM, Wusthoff CJ, Gallistel CR, Gibbon J (2001) Numerical Subtraction in the Pigeon: Evidence for a Linear Subjective Number Scale. Psychol Sci 12(3):238–243
work page 2001
-
[9]
(2009) Number-based visual generalisation in the honeybee
Gross HJ, et al. (2009) Number-based visual generalisation in the honeybee. PLoS One 4(1). doi:10.1371/journal.pone.0004263
-
[10]
Cantlon JF, Brannon EM (2006) Shared System for Ordering Small and Large Numbers in Monkeys and Humans. Psychol Sci 17(5):401–406
work page 2006
-
[11]
Nieder A, Dehaene S (2009) Representation of Number in the Brain. Annu Rev Neurosci
work page 2009
-
[12]
Park J, Dewind NK, Woldorff MG, Brannon EM (2015) Rapid and Direct Encoding of Numerosity in the Visual Stream. Cereb Cortex 26(2):748–763
work page 2015
-
[13]
Proc Natl Acad Sci 106(25):10382–5
Izard V, Sann C, Spelke ES, Streri A (2009) Newborn infants perceive abstract numbers. Proc Natl Acad Sci 106(25):10382–5
work page 2009
-
[14]
Xu F, Spelke ES, Goddard S (2005) Number sense in human infants. Dev Sci 8(1):88–101
work page 2005
-
[15]
Halberda J, Feigenson L (2008) Developmental change in the acuity of the “Number Sense”: The Approximate Number System in 3-, 4-, 5-, and 6-year-olds and adults. Dev Psychol 44(5):1457–65
work page 2008
-
[16]
Proc Natl Acad Sci 109(28):11116–20
Halberda J, Ly R, Wilmer JB, Naiman DQ, Germine L (2012) Number sense across the lifespan as revealed by a massive Internet-based sample. Proc Natl Acad Sci 109(28):11116–20
work page 2012
-
[17]
Proc Natl Acad Sci 110(45):18116–18120
Starr A, Libertus ME, Brannon EM (2013) Number sense in infancy predicts mathematical abilities in childhood. Proc Natl Acad Sci 110(45):18116–18120
work page 2013
-
[18]
Schneider M, et al. (2017) Associations of non-symbolic and symbolic numerical magnitude processing with mathematical competence: a meta-analysis. Dev Sci 20(3). doi:10.1111/desc.12372
-
[19]
Piazza M, et al. (2010) Developmental Trajectory of Number Acuity Reveals a Severe Impairment in Developmental Dyscalculia. Cognition 116(1):33–41
work page 2010
-
[20]
Burr DC, Ross J (2008) A Visual Sense of Number. Curr Biol 18(6):425–428
work page 2008
-
[21]
Ferrigno S, Jara-Ettinger J, Piantadosi ST, Cantlon JF (2017) Universal and uniquely human factors in spontaneous number perception. Nat Commun 8:13968
work page 2017
-
[22]
Cicchini GM, Anobile G, Burr DC (2016) Spontaneous perception of numerosity in humans. Nat Commun 7:12536
work page 2016
-
[23]
Feigenson L, Dehaene S, Spelke ES (2004) Core systems of number. Trends Cogn Sci 8(7):307–314
work page 2004
-
[24]
Proc Natl Acad Sci 103(51):19599–19604
Hurewitz F, Gelman R, Schnitzer B (2006) Sometimes area counts more than number. Proc Natl Acad Sci 103(51):19599–19604
work page 2006
-
[25]
Feigenson L, Carey S, Spelke E (2002) Infants’ Discrimination of Number vs. Continuous Extent. Cogn Psychol 44(1):33–66
work page 2002
-
[26]
Clearfield MW, Mix KS (1999) Number Versus Contour Length in Infants’ Discrimination of Small Visual Sets. Psychol Sci 10(5):408–411
work page 1999
-
[27]
Gebuis T, Gevers W (2011) Numerosities and space; indeed a cognitive illusion! A reply to de Hevia and Spelke (2009). Cognition 121(2):248–252
work page 2011
-
[28]
Dehaene S, Izard V, Piazza M (2005) Control over non-numerical parameters in numerosity experiments. 17
work page 2005
-
[29]
Acta Psychol (Amst) 161:177–184
Clayton S, Gilmore C, Inglis M (2015) Dot comparison stimuli are not all alike: The effect of different visual controls on ANS measurement. Acta Psychol (Amst) 161:177–184
work page 2015
-
[30]
J Exp Psychol Gen 141(4):642–8
Gebuis T, Reynvoet B (2012) The interplay between nonsymbolic number and its continuous visual properties. J Exp Psychol Gen 141(4):642–8
work page 2012
-
[31]
Gilmore C, et al. (2013) Individual Differences in Inhibitory Control, Not Non-Verbal Number Acuity, Correlate with Mathematics Achievement. PLoS One 8(6):1–9
work page 2013
-
[32]
Cappelletti M, Didino D, Stoianov I, Zorzi M (2014) Number skills are maintained in healthy ageing. Cogn Psychol 69:25–45
work page 2014
-
[33]
Gebuis T, Cohen Kadosh R, Gevers W (2016) Sensory-integration system rather than approximate number system underlies numerosity processing : A critical review. Acta Psychol (Amst) 171:1–71
work page 2016
-
[34]
Leibovich T, Katzin N, Harel M, Henik A (2017) From “sense of number” to “sense of magnitude” - The role of continuous magnitudes in numerical cognition. Behav Brain Sci 164. doi:10.3389/fpsyg.2017.00652
-
[35]
Starr A, DeWind NK, Brannon EM (2017) The contributions of numerical acuity and non- numerical stimulus features to the development of the number sense and symbolic math achievement. Cognition 168:222–233
work page 2017
-
[36]
Piazza M, Feo V De, Panzeri S, Dehaene S (2018) Learning to focus on number. Cognition 181(June 2017):1–29
work page 2018
-
[37]
Bugden S, Ansari D (2016) Probing the nature of deficits in the ‘Approximate Number System’ in children with persistent Developmental Dyscalculia. Dev Sci 19(5):817–833
work page 2016
-
[38]
LeCun Y, Bengio Y, Hinton GE (2015) Deep learning. Nature 521(7553):436–444
work page 2015
-
[39]
Kriegeskorte N (2015) Deep Neural Networks: A New Framework for Modeling Biological Vision and Brain Information Processing. Annu Rev Vis Sci 1(1):417–446
work page 2015
-
[40]
Testolin A, Stoianov I, Zorzi M (2017) Letter perception emerges from unsupervised deep learning and recycling of natural image features. Nat Hum Behav 1(9):657–664
work page 2017
-
[41]
Hassabis D, Kumaran D, Summerfield C, Botvinick M (2017) Neuroscience-Inspired Artificial Intelligence. Neuron 95(2):245–258
work page 2017
-
[42]
Yamins DLK, DiCarlo JJ (2016) Using goal-driven deep learning models to understand sensory cortex. Nat Neurosci 19(3):356–365
work page 2016
-
[43]
Testolin A, Zorzi M (2016) Probabilistic Models and Generative Neural Networks: Towards an Unified Framework for Modeling Normal and Impaired Neurocognitive Functions. Front Comput Neurosci 10(73). doi:10.3389/fncom.2016.00073
-
[44]
Lake BM, Ullman TD, Tenenbaum JB, Gershman SJ (2017) Building Machines That Learn and Think Like People. Behav Brain Sci 40:1--72
work page 2017
-
[45]
Zorzi M, Testolin A, Stoianov IPIP (2013) Modeling language and cognition with deep unsupervised learning: a tutorial overview. Front Psychol 4(August):515
work page 2013
-
[46]
Hinton GE, Osindero S, Teh Y (2006) A fast learning algorithm for deep belief nets. Neural Comput 18(7):1527–1554
work page 2006
-
[47]
Fiser J, Berkes P, Orbán G, Lengyel M (2010) Statistically optimal perception and learning: from behavior to neural representations. Trends Cogn Sci 14(3):119–30
work page 2010
-
[48]
Friston KJ (2010) The free-energy principle: a unified brain theory? Nat Rev Neurosci 11(2):127–38. 18
work page 2010
-
[49]
Viswanathan P, Nieder A (2013) Neuronal correlates of a visual “sense of number” in primate parietal and prefrontal cortices. Proc Natl Acad Sci 110(27):11187–92
work page 2013
-
[50]
Izard V, Dehaene-Lambertz G, Dehaene S (2008) Distinct cerebral pathways for object identity and number in human infants. PLoS Biol 6(2):0275–0285
work page 2008
-
[51]
Stoianov I, Zorzi M (2012) Emergence of a “visual number sense” in hierarchical generative models. Nat Neurosci 15(2):194–6
work page 2012
-
[52]
Philos Trans R Soc B Biol Sci 373(1740)
Zorzi M, Testolin A (2018) An emergentist perspective on the origin of number sense. Philos Trans R Soc B Biol Sci 373(1740). doi:10.1098/rstb.2017.0043
-
[53]
DeWind NK, Adams GK, Platt ML, Brannon EM (2015) Modeling the approximate number system to quantify the contribution of visual stimulus features. Cognition 142:247–265
work page 2015
-
[54]
Front Syst Neurosci 2(November):1–28
Kriegeskorte N (2008) Representational similarity analysis – connecting the branches of systems neuroscience. Front Syst Neurosci 2(November):1–28
work page 2008
-
[55]
Benjamini Y, Hochberg Y (1995) Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. J R Stat Soc B 57(1):289–300
work page 1995
-
[56]
Maaten L Van Der, Hinton GE (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579– 2605
work page 2008
-
[57]
Cantrell L, Smith LB (2013) Open questions and a proposal: A critical review of the evidence on infant numerical abilities. Cognition 128(3):331–352
work page 2013
-
[58]
Stoianov I, Zorzi M (2017) Computational foundations of the visual number sense. Behav Brain Sci
work page 2017
-
[59]
Nat Rev Neurosci 17(6):366–382
Nieder A (2016) The neuronal code for number. Nat Rev Neurosci 17(6):366–382
work page 2016
-
[60]
(2016) The limitations of deep learning in adversarial settings
Papernot N, et al. (2016) The limitations of deep learning in adversarial settings. Proc - 2016 IEEE Eur Symp Secur Privacy, EURO S P 2016:372–387
work page 2016
-
[61]
Perry C, Zorzi M, Ziegler JC (2019) Understanding Dyslexia Through Personalized Large-Scale Computational Models. Psychol Sci:095679761882354
work page 2019
-
[62]
Selinummi J, Seppälä J, Yli-Harja O, Puhakka JA (2005) Software for quantification of labeled bacteria from digital microscope images by automated image analysis. Biotechniques 39(6):859–862
work page 2005
-
[63]
Understanding Traffic Density from Large-Scale Web Camera Data
Zhang S, Wu G, Costeira JP, Moura JMF (2017) Understanding Traffic Density from Large- Scale Web Camera Data. Available at: http://arxiv.org/abs/1703.05868
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[64]
Zhang C, Li H, Wang X, Yang X (2015) Cross-scene crowd counting via deep convolutional neural networks. Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp 833–841
work page 2015
-
[65]
Adv Neural Inf Process Syst:1324–1332
Lempitsky V, Zisserman A (2010) Learning To Count Objects in Images. Adv Neural Inf Process Syst:1324–1332
work page 2010
-
[66]
(2015) Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
Weston J, et al. (2015) Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks. International Conference on Representation Learning doi:10.1016/j.jpowsour.2014.09.131
-
[67]
Trask A, et al. (2018) Neural Arithmetic Logic Units Available at: http://arxiv.org/abs/1808.00508
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[68]
Phys D Nonlinar Phenom 42(1–3):335–346
Harnad S (1990) The symbol grounding problem. Phys D Nonlinar Phenom 42(1–3):335–346
work page 1990
-
[69]
Searle JR (1980) Minds, brains, and programs. Behav Brain Sci 3(03):417–424. 19
work page 1980
-
[70]
Leibovich T, Ansari D (2016) The Symbol-Grounding Problem in Numerical Cognition: A Review of Theory, Evidence, and Outstanding Questions. Can J Exp Psychol 70(1):12–23
work page 2016
-
[71]
(2014) A Toolbox for Representational Similarity Analysis
Nili H, et al. (2014) A Toolbox for Representational Similarity Analysis. PLoS Comput Biol 10(4). doi:10.1371/journal.pcbi.1003553
-
[72]
Ackley D, Hinton GE, Sejnowski TJ (1985) A learning algorithm for Boltzmann machines. Cogn Sci 9(1):147–169
work page 1985
-
[73]
Hinton GE (2002) Training products of experts by minimizing contrastive divergence. Neural Comput 14(8):1771–1800
work page 2002
-
[74]
Testolin A, Stoianov I, De Filippo De Grazia M, Zorzi M (2013) Deep unsupervised learning on a desktop PC : A primer for cognitive scientists. Front Psychol 4(May):251
work page 2013
-
[75]
Hertz JA, Krogh AS, Palmer RG (1991) Introduction to the theory of neural computation (Addison-Weasley, Redwood City, CA)
work page 1991
-
[76]
(2014) Generative Adversarial Nets
Goodfellow I, et al. (2014) Generative Adversarial Nets. Adv Neural Inf Process Syst:2672– 2680
work page 2014
-
[77]
Zanetti A, Testolin A, Zorzi M, Wawrzynski P (2019) Numerosity Representation in InfoGAN: An Empirical Study. Advances in Computational Intelligence. IWANN, eds I. R, G. J, A. C (Springer International Publishing, Cham), pp 49–6 20 Supplementary Methods Stimulus space definition The relationship betwee n the three orthogonal dimensions Numerosity, Size an...
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.