What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features

Guy Van den Broeck; Pasha Khosravi; Yitao Liang; YooJung Choi

arxiv: 1903.01620 · v2 · pith:BMCVFJB7new · submitted 2019-03-05 · 💻 cs.LG · cs.AI· stat.ML

What to Expect of Classifiers? Reasoning about Logistic Regression with Missing Features

Pasha Khosravi , Yitao Liang , YooJung Choi , Guy Van den Broeck This is my paper

classification 💻 cs.LG cs.AIstat.ML

keywords featuresmissinglogisticregressionclassifiersdistributionexpectedprediction

0 comments

read the original abstract

While discriminative classifiers often yield strong predictive performance, missing feature values at prediction time can still be a challenge. Classifiers may not behave as expected under certain ways of substituting the missing values, since they inherently make assumptions about the data distribution they were trained on. In this paper, we propose a novel framework that classifies examples with missing features by computing the expected prediction with respect to a feature distribution. Moreover, we use geometric programming to learn a naive Bayes distribution that embeds a given logistic regression classifier and can efficiently take its expected predictions. Empirical evaluations show that our model achieves the same performance as the logistic regression with all features observed, and outperforms standard imputation techniques when features go missing during prediction time. Furthermore, we demonstrate that our method can be used to generate "sufficient explanations" of logistic regression classifications, by removing features that do not affect the classification.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

SeBA: Semi-supervised few-shot learning via Separated-at-Birth Alignment for tabular data
cs.LG 2026-05 unverdicted novelty 7.0

SeBA is a joint-embedding framework that separates tabular data into two complementary views and aligns one view's representations to the nearest-neighbor structure of the other, improving feature-label relationships ...