AS titch I n L anguage M odels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models

Tayyar Madabushi, Harish, Gow-Smith, Edward, Scarton, Carolina, Villavicencio, Aline · 2021 · DOI 10.18653/v1/2021.findings-emnlp.294

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

Multilingual Idioms in Sentences and Conversations Across High-, Medium-, and Low-Resource Languages

cs.CL · 2026-06-01 · unverdicted · novelty 7.0

MIDI is a new multilingual idiom dataset with sentence and conversational contexts; benchmarking reveals worse performance in low-resource languages and on literal vs. figurative uses.

Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models

cs.CL · 2026-04-17 · unverdicted · novelty 4.0 · 2 refs

SemanticQA unifies prior multiword expression datasets into a benchmark that reveals substantial performance variation among language models on semantic reasoning tasks.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Multilingual Idioms in Sentences and Conversations Across High-, Medium-, and Low-Resource Languages cs.CL · 2026-06-01 · unverdicted · none · ref 22
MIDI is a new multilingual idiom dataset with sentence and conversational contexts; benchmarking reveals worse performance in low-resource languages and on literal vs. figurative uses.
Revisiting a Pain in the Neck: A Semantic Reasoning Benchmark for Language Models cs.CL · 2026-04-17 · unverdicted · none · ref 29 · 2 links
SemanticQA unifies prior multiword expression datasets into a benchmark that reveals substantial performance variation among language models on semantic reasoning tasks.

AS titch I n L anguage M odels: Dataset and Methods for the Exploration of Idiomaticity in Pre-Trained Language Models

fields

years

verdicts

representative citing papers

citing papers explorer