Gemini 3 Deep Think Preview Verification on ARC-AGI-2.https:// huggingface.co/datasets/arcprize/arc_agi_v2_public_eval, 2026

ARC Prize Foundation · 2026

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

cs.AI · 2026-03-24 · unverdicted · novelty 6.0

ARC-AGI-3 is a benchmark where humans solve 100% of tasks but frontier AI systems score below 1% as of March 2026, using efficiency-based scoring grounded in human baselines.

citing papers explorer

Showing 1 of 1 citing paper.

ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence cs.AI · 2026-03-24 · unverdicted · none · ref 7
ARC-AGI-3 is a benchmark where humans solve 100% of tasks but frontier AI systems score below 1% as of March 2026, using efficiency-based scoring grounded in human baselines.

Gemini 3 Deep Think Preview Verification on ARC-AGI-2.https:// huggingface.co/datasets/arcprize/arc_agi_v2_public_eval, 2026

fields

years

verdicts

representative citing papers

citing papers explorer