S em E val-2013 Task 4: Free Paraphrases of Noun Compounds

Hendrickx, Iris, Kozareva, Zornitsa, Nakov, Preslav, Szpakowicz, Stan, Veale, Tony · 2013

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

A new benchmark shows LLM first-answer accuracy on procedural arithmetic drops from 63% (5 steps) to 20% (95 steps) due to execution failures like skipped steps and premature answers.

citing papers explorer

Showing 1 of 1 citing paper after filters.

When LLMs Stop Following Steps: A Diagnostic Study of Procedural Execution in Language Models cs.CL · 2026-05-01 · unverdicted · none · ref 140
A new benchmark shows LLM first-answer accuracy on procedural arithmetic drops from 63% (5 steps) to 20% (95 steps) due to execution failures like skipped steps and premature answers.

S em E val-2013 Task 4: Free Paraphrases of Noun Compounds

fields

years

verdicts

representative citing papers

citing papers explorer