Accessed: 2025-01-30

URL https://deepmind · 2025 · DOI 10.1088/2632-215

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open at publisher browse 1 citing papers

representative citing papers

The Two-Hump Problem: Bridging the Difficulty Gap in Mathematical Reinforcement Learning

cs.LG · 2026-06-19 · unverdicted · novelty 5.0

The authors identify a structural barrier ('two-hump' difficulty distribution) in RL for mathematical search problems like the Andrews-Curtis conjecture and propose data generation plus algorithmic enhancements to create solvable intermediate instances, releasing AC-19 and AC-1M datasets.

citing papers explorer

Showing 1 of 1 citing paper.

The Two-Hump Problem: Bridging the Difficulty Gap in Mathematical Reinforcement Learning cs.LG · 2026-06-19 · unverdicted · none · ref 1
The authors identify a structural barrier ('two-hump' difficulty distribution) in RL for mathematical search problems like the Andrews-Curtis conjecture and propose data generation plus algorithmic enhancements to create solvable intermediate instances, releasing AC-19 and AC-1M datasets.

Accessed: 2025-01-30

fields

years

verdicts

representative citing papers

citing papers explorer