hub

Nature , year =

Gottweis, Juraj, Weng, Wei-Hung, Daryin, Andrey, others , title = · 2026 · DOI 10.1038/s41586-026-10644-y

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

open at publisher browse 10 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

representative citing papers

NatureBench: Can Coding Agents Match the Published SOTA of Nature-Family Papers?

cs.CL · 2026-06-23 · unverdicted · novelty 7.0

NatureBench evaluates ten frontier AI coding agents on 90 tasks from Nature papers under web-search-disabled conditions and finds the strongest agent surpasses published SOTA on only 17.8% of tasks, succeeding mainly by translating problems into familiar supervised learning setups.

Closed-loop Auto Research for Molecular Property Prediction: Discovering and Certifying Generalizable Improvements

cs.AI · 2026-06-22 · unverdicted · novelty 6.0

Closed-loop LM-agent auto research finds some transferable gains on molecular property prediction benchmarks via external data but shows non-transfer for model and feature edits selected on validation.

Deterministic Integrity Gates for LLM-Assisted Clinical Manuscript Preparation: An Auditable Biomedical Informatics Architecture

cs.AI · 2026-06-08 · unverdicted · novelty 6.0

Presents MedSci Skills, an open-source toolkit with deterministic integrity gates for verifying LLM-assisted clinical manuscripts against reporting guidelines like STARD, PRISMA, and STROBE.

DN-Hypo-Pipeline: An AI-Driven Workflow for Generating Hypotheses using Large Language Models and Scientific Explanations

cs.AI · 2026-06-07 · unverdicted · novelty 6.0

DN-Hypo-Pipeline operationalizes three philosophy-of-science accounts to direct LLMs toward principle-based hypothesis generation, claims superior performance over direct prompting, and derives two new transformer algorithms from the resulting hypotheses.

Agentic Language-to-Objective Synthesis for Optofluidic Assembly

cs.RO · 2026-05-26 · unverdicted · novelty 6.0

Speak-to-Objective is a modular agentic pipeline that translates spoken or written commands into fully differentiable objective functions for optofluidic microparticle assembly using LLMs, inverse solvers, and experimental platforms.

Ontology-constrained multi-LLM scoring of hypothesis support in the predictive processing literature

q-bio.NC · 2026-05-23 · unverdicted · novelty 6.0

A multi-LLM council scores predictive processing papers on an expert ontology, maps results in 3D hypothesis space, and introduces a dispersion metric showing greater spread in global versus local oddball paradigms.

From Meta Idea to Advanced Mathematical Discovery -- Human-AI Co-Discovery of Sign-Embedding Quantum Algorithms

cs.LG · 2026-06-12 · unverdicted · novelty 5.0

Human-AI collaboration expanded a meta-idea on rational approximation into sign-embedding quantum algorithms for matrix problems, with humans retaining final judgment on routes and refinements.

Cross-domain benchmarks reveal when coordinated AI agents improve scientific inference from partial evidence

cs.AI · 2026-05-21 · unverdicted · novelty 5.0

Coordinated AI agents improve scientific inference from partial evidence in cross-domain tasks when single sources are incomplete, as demonstrated by AUROC gains in vector-borne disease and exoplanet benchmarks but tied performance in others.

The Calibration Turn in AI-Assisted Research: A Conceptual and Methodological Framework for Evidence-Licensed Claims

cs.LG · 2026-06-30 · unverdicted · novelty 4.0

Develops a framework representing AI-assisted research via five operators and principles for evidence-licensed claims, distinguishing claim semantics and introducing epistemic debt.

Hephaestus: Toward a Cybersecurity AI Scientist

cs.CR · 2026-06-29 · unverdicted · novelty 4.0

The paper proposes the Cybersecurity AI Scientist as a modular multi-agent architecture for automating cybersecurity research, distinguished by its focus on non-stationary threats and anchored in a four-zeros risk-trust-incident-energy frame.

citing papers explorer

Showing 4 of 4 citing papers after filters.

Closed-loop Auto Research for Molecular Property Prediction: Discovering and Certifying Generalizable Improvements cs.AI · 2026-06-22 · unverdicted · none · ref 19
Closed-loop LM-agent auto research finds some transferable gains on molecular property prediction benchmarks via external data but shows non-transfer for model and feature edits selected on validation.
Deterministic Integrity Gates for LLM-Assisted Clinical Manuscript Preparation: An Auditable Biomedical Informatics Architecture cs.AI · 2026-06-08 · unverdicted · none · ref 18
Presents MedSci Skills, an open-source toolkit with deterministic integrity gates for verifying LLM-assisted clinical manuscripts against reporting guidelines like STARD, PRISMA, and STROBE.
DN-Hypo-Pipeline: An AI-Driven Workflow for Generating Hypotheses using Large Language Models and Scientific Explanations cs.AI · 2026-06-07 · unverdicted · none · ref 7
DN-Hypo-Pipeline operationalizes three philosophy-of-science accounts to direct LLMs toward principle-based hypothesis generation, claims superior performance over direct prompting, and derives two new transformer algorithms from the resulting hypotheses.
Cross-domain benchmarks reveal when coordinated AI agents improve scientific inference from partial evidence cs.AI · 2026-05-21 · unverdicted · none · ref 3
Coordinated AI agents improve scientific inference from partial evidence in cross-domain tasks when single sources are incomplete, as demonstrated by AUROC gains in vector-borne disease and exoplanet benchmarks but tied performance in others.

Nature , year =

hub tools

fields

years

verdicts

representative citing papers

citing papers explorer