Title resolution pending

If Pam is currently twice as young as Rena, that means that Rena is currently twice as old as Pam is

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Exploration-Driven Optimization for Test-Time Large Language Model Reasoning

cs.LG · 2026-05-11 · unverdicted · novelty 5.0

EDO integrates exploration objectives into RL post-training of LLMs, yielding greater solution diversity, 1.0-1.3% gains on in-distribution reasoning benchmarks, and 1.5% on out-of-distribution tasks when paired with test-time methods.

citing papers explorer

Showing 1 of 1 citing paper.

Exploration-Driven Optimization for Test-Time Large Language Model Reasoning cs.LG · 2026-05-11 · unverdicted · none · ref 11
EDO integrates exploration objectives into RL post-training of LLMs, yielding greater solution diversity, 1.0-1.3% gains on in-distribution reasoning benchmarks, and 1.5% on out-of-distribution tasks when paired with test-time methods.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer