pith. sign in

Title resolution pending

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CL 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

HERO'S JOURNEY: Testing Complex Rule Induction with Text Games

cs.CL · 2026-06-01 · unverdicted · novelty 6.0

HERO'S JOURNEY benchmark evaluates LLMs on attribute and procedural rule induction across four structural forms, finding limited uneven performance with execution as the main bottleneck and steering helping only attribute tasks.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • HERO'S JOURNEY: Testing Complex Rule Induction with Text Games cs.CL · 2026-06-01 · unverdicted · none · ref 52

    HERO'S JOURNEY benchmark evaluates LLMs on attribute and procedural rule induction across four structural forms, finding limited uneven performance with execution as the main bottleneck and steering helping only attribute tasks.