Explainable Neural Networks based on Additive Index Models

Agus Sudjianto; Erind Brahimi; Jie Chen; Joel Vaughan; Vijayan N. Nair

arxiv: 1806.01933 · v1 · pith:HEKEJFVInew · submitted 2018-06-05 · 📊 stat.ML · cs.LG

Explainable Neural Networks based on Additive Index Models

Joel Vaughan , Agus Sudjianto , Erind Brahimi , Jie Chen , Vijayan N. Nair This is my paper

classification 📊 stat.ML cs.LG

keywords neuralfeaturesnetworkexplainableinterpretablemodelmodelsnetworks

0 comments

read the original abstract

Machine Learning algorithms are increasingly being used in recent years due to their flexibility in model fitting and increased predictive performance. However, the complexity of the models makes them hard for the data analyst to interpret the results and explain them without additional tools. This has led to much research in developing various approaches to understand the model behavior. In this paper, we present the Explainable Neural Network (xNN), a structured neural network designed especially to learn interpretable features. Unlike fully connected neural networks, the features engineered by the xNN can be extracted from the network in a relatively straightforward manner and the results displayed. With appropriate regularization, the xNN provides a parsimonious explanation of the relationship between the features and the output. We illustrate this interpretable feature--engineering property on simulated examples.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Sparse Deep Additive Model with Interactions: Enhancing Interpretability and Predictability
stat.ML 2025-09 unverdicted novelty 6.0

SDAMI detects interactions in high-dimensional data via an Effect Footprint principle and models them using sparsity, group lasso, and dedicated deep subnetworks for improved interpretability.
Explainability Methods for Hardware Trojan Detection: A Systematic Comparison
cs.LG 2026-01 unverdicted novelty 4.0

Compares domain-aware, case-based, and feature attribution explainability methods for gate-level hardware Trojan detection on the Trust-Hub benchmark dataset.