Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models

Jiayang Gao; Qihan Ren; Quanshi Zhang; Wen Shen

arxiv: 2305.01939 · v2 · pith:FSFT25MSnew · submitted 2023-05-03 · 💻 cs.LG · cs.AI· cs.CV

Where We Have Arrived in Proving the Emergence of Sparse Symbolic Concepts in AI Models

Qihan Ren , Jiayang Gao , Wen Shen , Quanshi Zhang This is my paper

classification 💻 cs.LG cs.AIcs.CV

keywords conditionsemergenceinferenceinputinteractionsoccludedprovesamples

0 comments

read the original abstract

This study aims to prove the emergence of symbolic concepts (or more precisely, sparse primitive inference patterns) in well-trained deep neural networks (DNNs). Specifically, we prove the following three conditions for the emergence. (i) The high-order derivatives of the network output with respect to the input variables are all zero. (ii) The DNN can be used on occluded samples and when the input sample is less occluded, the DNN will yield higher confidence. (iii) The confidence of the DNN does not significantly degrade on occluded samples. These conditions are quite common, and we prove that under these conditions, the DNN will only encode a relatively small number of sparse interactions between input variables. Moreover, we can consider such interactions as symbolic primitive inference patterns encoded by a DNN, because we show that inference scores of the DNN on an exponentially large number of randomly masked samples can always be well mimicked by numerical effects of just a few interactions.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Towards the Connection between Activation Sparsity and Flat Minima
cs.LG 2026-05 unverdicted novelty 5.0

MLP activation sparsity equals augmented flatness divided by input norm times gradient; the ratio falls during training and can be reduced further by three plug-and-play changes, yielding higher sparsity on ImageNet and C4.