is_pun": true/false} IMPORTANT: Output ONLY the JSON object, no additional text or explanation. Note: The biased-to-non-pun variant changes the task description to

Naturalness:Are the caption, visual scenario natural, plausible? Samples are retained if at least 2 out of 3 annotators agree on acceptance · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

"I See What You Did There": Can Large Vision-Language Models Understand Multimodal Puns?

cs.CL · 2026-04-07 · unverdicted · novelty 6.0

Vision-language models largely fail to distinguish multimodal puns from adversarial non-puns but gain an average 16.5% F1 improvement from prompt-level and model-level interventions.

citing papers explorer

Showing 1 of 1 citing paper.

"I See What You Did There": Can Large Vision-Language Models Understand Multimodal Puns? cs.CL · 2026-04-07 · unverdicted · none · ref 11
Vision-language models largely fail to distinguish multimodal puns from adversarial non-puns but gain an average 16.5% F1 improvement from prompt-level and model-level interventions.

is_pun": true/false} IMPORTANT: Output ONLY the JSON object, no additional text or explanation. Note: The biased-to-non-pun variant changes the task description to

fields

years

verdicts

representative citing papers

citing papers explorer