Won't Get Fooled Again: Answering Questions with False Premises

Huadong Wang; Maosong Sun; Shengding Hu; Xingyi Cheng; Yifan Luo; Zhiyuan Liu

arxiv: 2307.02394 · v1 · pith:4HSX2MRAnew · submitted 2023-07-05 · 💻 cs.CL

Won't Get Fooled Again: Answering Questions with False Premises

Shengding Hu , Yifan Luo , Huadong Wang , Xingyi Cheng , Zhiyuan Liu , Maosong Sun This is my paper

classification 💻 cs.CL

keywords plmsquestionsfpqsfalseknowledgepremisesexplanationsfalseqa

0 comments

read the original abstract

Pre-trained language models (PLMs) have shown unprecedented potential in various fields, especially as the backbones for question-answering (QA) systems. However, they tend to be easily deceived by tricky questions such as "How many eyes does the sun have?". Such frailties of PLMs often allude to the lack of knowledge within them. In this paper, we find that the PLMs already possess the knowledge required to rebut such questions, and the key is how to activate the knowledge. To systematize this observation, we investigate the PLMs' responses to one kind of tricky questions, i.e., the false premises questions (FPQs). We annotate a FalseQA dataset containing 2365 human-written FPQs, with the corresponding explanations for the false premises and the revised true premise questions. Using FalseQA, we discover that PLMs are capable of discriminating FPQs by fine-tuning on moderate numbers (e.g., 256) of examples. PLMs also generate reasonable explanations for the false premise, which serve as rebuttals. Further replaying a few general questions during training allows PLMs to excel on FPQs and general questions simultaneously. Our work suggests that once the rebuttal ability is stimulated, knowledge inside the PLMs can be effectively utilized to handle FPQs, which incentivizes the research on PLM-based QA systems.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Gemini: A Family of Highly Capable Multimodal Models
cs.CL 2023-12 conditional novelty 6.0

Gemini Ultra reaches human-expert performance on MMLU for the first time and sets new state-of-the-art results on 30 of 32 benchmarks, including all 20 multimodal ones tested.
Scaling with Confidence: Calibrating Confidence of LLMs for Adaptive Test Time Scaling
cs.AI 2026-07 unverdicted novelty 5.0

C3RL is a new RL algorithm combining correctness, calibration, and reference accuracy rewards to improve LLM confidence calibration, enabling CAS to outperform majority voting with up to 12.33x lower inference cost.