In NeurIPS 2025 AI for Science Workshop

Opendiscovery: A verifiable, creative science problem-solving dataset to forge AI scientists · 2025 · arXiv 2601.20833

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery

cs.AI · 2026-04-28 · accept · novelty 8.0

AutoResearchBench is a new benchmark showing top AI agents achieve under 10% success on complex scientific literature discovery tasks that demand deep comprehension and open-ended search.

Toward Autonomous Long-Horizon Engineering for ML Research

cs.CL · 2026-04-14

Learning to Predict Future-Aligned Research Proposals with Language Models

cs.CL · 2026-03-28

citing papers explorer

Showing 3 of 3 citing papers.

AutoResearchBench: Benchmarking AI Agents on Complex Scientific Literature Discovery cs.AI · 2026-04-28 · accept · none · ref 7
AutoResearchBench is a new benchmark showing top AI agents achieve under 10% success on complex scientific literature discovery tasks that demand deep comprehension and open-ended search.
Toward Autonomous Long-Horizon Engineering for ML Research cs.CL · 2026-04-14 · unreviewed · ref 23
Learning to Predict Future-Aligned Research Proposals with Language Models cs.CL · 2026-03-28 · unreviewed · ref 4

In NeurIPS 2025 AI for Science Workshop

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer