Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Em\=ils Kadi\c{k}is; Roman Klinger; Vaibhav Srivastav

arxiv: 2202.10408 · v3 · pith:GZMRFYRYnew · submitted 2022-02-21 · 💻 cs.CL

Embarrassingly Simple Performance Prediction for Abductive Natural Language Inference

Em\=ils Kadi\c{k}is , Vaibhav Srivastav , Roman Klinger This is my paper

classification 💻 cs.CL

keywords modelperformancesimilaritytaskwhenabductiveaccuracyalpha

0 comments

read the original abstract

The task of abductive natural language inference (\alpha{}nli), to decide which hypothesis is the more likely explanation for a set of observations, is a particularly difficult type of NLI. Instead of just determining a causal relationship, it requires common sense to also evaluate how reasonable an explanation is. All recent competitive systems build on top of contextualized representations and make use of transformer architectures for learning an NLI model. When somebody is faced with a particular NLI task, they need to select the best model that is available. This is a time-consuming and resource-intense endeavour. To solve this practical problem, we propose a simple method for predicting the performance without actually fine-tuning the model. We do this by testing how well the pre-trained models perform on the \alpha{}nli task when just comparing sentence embeddings with cosine similarity to what the performance that is achieved when training a classifier on top of these embeddings. We show that the accuracy of the cosine similarity approach correlates strongly with the accuracy of the classification approach with a Pearson correlation coefficient of 0.65. Since the similarity computation is orders of magnitude faster to compute on a given dataset (less than a minute vs. hours), our method can lead to significant time savings in the process of model selection.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

HypoAgent: An Agentic Framework for Interactive Abductive Hypothesis Generation over Knowledge Graphs
cs.AI 2026-05 unverdicted novelty 6.0

HypoAgent is an agentic framework using intent recognition, hypothesis generation, and root cause analysis agents that reports state-of-the-art semantic similarity on commonsense and biomedical knowledge graphs under ...