Lake, and Todd M

Solim LeGris, Wai Keen Vong, Brenden M · 2024 · arXiv 2409.01374

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

ARC-AGI-2: A New Challenge for Frontier AI Reasoning Systems

cs.AI · 2025-05-17 · unverdicted · novelty 6.0

ARC-AGI-2 adds a larger, more complex set of tasks to the original ARC-AGI benchmark to give finer-grained measurement of fluid intelligence in AI.

Structural Grid Descriptors Predict Within-Task Solver Success on ARC-AGI

cs.LG · 2026-06-08 · conditional · novelty 5.0

Hand-crafted grid descriptors at 50% trajectory completion predict within-task ARC-AGI solver success (AUC 0.885) and transfer across solvers (AUC 0.75).

Think Fast: Estimating No-CoT Task-Completion Time Horizons of Frontier AI Models

cs.AI · 2026-06-05 · unverdicted · novelty 5.0 · 2 refs

Frontier AI models' no-CoT 50% task-completion time horizons have doubled yearly over six years, reaching over 3 minutes for GPT-5.5 with projections to 25 minutes by 2030.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Structural Grid Descriptors Predict Within-Task Solver Success on ARC-AGI cs.LG · 2026-06-08 · conditional · none · ref 2
Hand-crafted grid descriptors at 50% trajectory completion predict within-task ARC-AGI solver success (AUC 0.885) and transfer across solvers (AUC 0.75).

Lake, and Todd M

fields

years

verdicts

representative citing papers

citing papers explorer