Hypothesis Only Baselines in Natural Language Inference

Adam Poliak; Aparajita Haldar; Benjamin Van Durme; Jason Naradowsky; Rachel Rudinger

Hypothesis Only Baselines in Natural Language Inference

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1805.01042 v1 pith:CP5NCJXP submitted 2018-05-02 cs.CL

Hypothesis Only Baselines in Natural Language Inference

Adam Poliak , Jason Naradowsky , Aparajita Haldar , Rachel Rudinger , Benjamin Van Durme This is my paper

classification cs.CL

keywords contextdatasetshypothesisinferencebaselinelanguagemodelnatural

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

We propose a hypothesis only baseline for diagnosing Natural Language Inference (NLI). Especially when an NLI dataset assumes inference is occurring based purely on the relationship between a context and a hypothesis, it follows that assessing entailment relations while ignoring the provided context is a degenerate solution. Yet, through experiments on ten distinct NLI datasets, we find that this approach, which we refer to as a hypothesis-only model, is able to significantly outperform a majority class baseline across a number of NLI datasets. Our analysis suggests that statistical irregularities may allow a model to perform NLI in some datasets beyond what should be achievable without access to the context.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Investigating Biases in Textual Entailment Datasets
cs.CL 2019-06 unverdicted novelty 5.0

Hypothesis-only classification reaches 64% accuracy on SNLI, revealing dataset biases in SNLI and MultiNLI that the authors quantify and propose a simple mitigation for.