Detecting Statistical Interactions from Neural Network Weights

Michael Tsang , Dehua Cheng , Yan Liu

Authors on Pith no claims yet

classification 📊 stat.ML cs.LG

keywords interactionsneuraldatasetsdetectinginterpretingmethodnetworkperformance

read the original abstract

Interpreting neural networks is a crucial and challenging task in machine learning. In this paper, we develop a novel framework for detecting statistical interactions captured by a feedforward multilayer neural network by directly interpreting its learned weights. Depending on the desired interactions, our method can achieve significantly better or similar interaction detection performance compared to the state-of-the-art without searching an exponential solution space of possible interactions. We obtain this accuracy and efficiency by observing that interactions between input features are created by the non-additive effect of nonlinear activation functions, and that interacting paths are encoded in weight matrices. We demonstrate the performance of our method and the importance of discovered interactions via experimental results on both synthetic datasets and real-world application datasets.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Evaluating LLMs on Large-Scale Graph Property Estimation via Random Walks
cs.LG 2026-05 unverdicted novelty 7.0

EstGraph benchmark evaluates LLMs on estimating properties of very large graphs from random-walk samples that fit in context limits.
Selecting Feature Interactions for Generalized Additive Models by Distilling Foundation Models
cs.LG 2026-04 unverdicted novelty 7.0

TabDistill distills feature interactions from tabular foundation models via post-hoc attribution and inserts them into GAMs, yielding consistent predictive gains.