Step by step network.arXiv preprint arXiv:2511.14329

Han, D · 2025 · arXiv 2511.14329

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

KAN-SAE applies nonlinear per-feature B-spline activations in sparse autoencoders to discover 72% more alive climate features and interpretable patterns such as European heatwaves and Pacific typhoons in deep learning weather models.

SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm

cs.LG · 2026-02-08 · unverdicted · novelty 6.0

SiameseNorm is a two-stream architecture that reconciles Pre-Norm and Post-Norm in Transformers by coupling streams via shared residual blocks, yielding performance gains with maintained stability on language, vision, and diffusion models.

citing papers explorer

Showing 2 of 2 citing papers.

Beyond Linear Superposition: Discovering Climate Features in AI Weather Models with KAN-SAE cs.LG · 2026-05-17 · unverdicted · none · ref 7
KAN-SAE applies nonlinear per-feature B-spline activations in sparse autoencoders to discover 72% more alive climate features and interpretable patterns such as European heatwaves and Pacific typhoons in deep learning weather models.
SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm cs.LG · 2026-02-08 · unverdicted · none · ref 10
SiameseNorm is a two-stream architecture that reconciles Pre-Norm and Post-Norm in Transformers by coupling streams via shared residual blocks, yielding performance gains with maintained stability on language, vision, and diffusion models.

Step by step network.arXiv preprint arXiv:2511.14329

fields

years

verdicts

representative citing papers

citing papers explorer