IdioLink introduces a benchmark dataset and evaluation showing that strong embedding models struggle to retrieve equivalent meanings across idiomatic and literal forms, relying on shallow cues instead.
S urvey: Multiword Expression Processing: A S urvey
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CL 3years
2026 3verdicts
UNVERDICTED 3representative citing papers
Systematic experiments show that text decomposition methods and privacy budget allocation strategies produce significantly different privacy-utility trade-offs even under comparable total epsilon budgets.
SemanticQA unifies prior multiword expression datasets into a benchmark that reveals substantial performance variation among language models on semantic reasoning tasks.
citing papers explorer
-
IdioLink: Retrieving Meaning Beyond Words Across Idiomatic and Literal Expressions
IdioLink introduces a benchmark dataset and evaluation showing that strong embedding models struggle to retrieve equivalent meanings across idiomatic and literal forms, relying on shallow cues instead.
-
A Systematic Exploration of Text Decomposition and Budget Distribution in Differentially Private Text Obfuscation
Systematic experiments show that text decomposition methods and privacy budget allocation strategies produce significantly different privacy-utility trade-offs even under comparable total epsilon budgets.
-
Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models
SemanticQA unifies prior multiword expression datasets into a benchmark that reveals substantial performance variation among language models on semantic reasoning tasks.