pith. sign in

hub

Studying large language model generalization with influence functions.arXiv preprint arXiv:2308.03296

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

hub tools

citation-role summary

background 2 method 1

citation-polarity summary

representative citing papers

Interaction-Aware Influence Functions for Group Attribution

cs.LG · 2026-05-15 · conditional · novelty 6.0

Extends influence functions with a second-order pairwise interaction term that improves group attribution accuracy over simple summation on multiple model-dataset pairs and instruction-tuning selection tasks.

Feature Identification via the Empirical NTK

cs.LG · 2025-10-01 · unverdicted · novelty 6.0

Eigenanalysis of the empirical NTK surfaces feature directions that align with Fourier features in modular addition networks and grammatical features in Gemma-3-270M, outperforming PCA baselines on activations.

citing papers explorer

Showing 16 of 16 citing papers.