pith. sign in

An analysis for reasoning bias of language models with small initialization

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

fields

cs.LG 2 cs.CL 1

years

2026 2 2025 1

representative citing papers

An overview of condensation phenomenon in deep learning

cs.LG · 2025-04-13 · unverdicted · novelty 2.0

Neural networks exhibit condensation of neurons into clusters with similar outputs whose number increases monotonically during training, facilitated by small initializations or dropout, providing insights into generalization and reasoning.

citing papers explorer

Showing 3 of 3 citing papers.