arXiv.org

Microsoft Research AI4Science, Microsoft Azure Quantum · 2023 · arXiv 2311.07361

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

representative citing papers

Efficient numeracy in language models through single-token number embeddings

cs.LG · 2025-10-08 · unverdicted · novelty 7.0

BitTokens represent numbers as single tokens via IEEE 754 binary format, allowing small language models to learn basic arithmetic algorithms nearly perfectly.

Active-GRPO: Adaptive Imitation and Self-Improving Reasoning for Molecular Optimization

cs.LG · 2026-07-01 · unverdicted · novelty 6.0

Active-GRPO reaches 0.1773 average SRxSim on TOMG-Bench MOLOPT by adaptively switching between imitation and self-reinforcement while upgrading references, outperforming GRPO and RePO.

LLM-ACES: Closed-Loop Discovery of Dynamical Systems with LLM-Guided Adaptive Search

cs.LG · 2026-06-23 · unverdicted · novelty 6.0

LLM-ACES is a closed-loop method that combines LLM-proposed operator priors with disagreement-driven adaptive data acquisition to discover governing ODEs, reporting lowest median NMSE and 46-52% symbolic accuracy on 122 systems.

LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

LLM-AutoSciLab proposes an LLM-driven closed-loop system for hypothesis generation and adaptive experiment selection that reports higher accuracy and 2-5x better sample efficiency than baselines on new chemistry and gene-network discovery benchmarks.

No Test Cases, No Problem: Distillation-Driven Code Generation for Scientific Workflows

cs.SE · 2026-04-25 · unverdicted · novelty 6.0

MOSAIC generates executable scientific code without I/O test cases by combining student-teacher distillation with a consolidated context window to reduce hallucinations across subproblems.

MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding

cs.CL · 2025-10-09 · unverdicted · novelty 6.0

MOSAIC is a training-free multi-agent LLM framework with rationale, coding, reflection, and debugging agents plus a consolidated context window that outperforms prior methods on scientific coding benchmarks.

Prioritizing High-Consequence Biological Capabilities in Evaluations of Artificial Intelligence Models

cs.CY · 2024-05-25 · unverdicted · novelty 4.0

AI model evaluations for biological capabilities should prioritize high-consequence risks like pandemics, informed by life sciences dual-use experience, and occur prior to deployment to enable biosafety measures.

An Engineering Journey Training Large Language Models at Scale on Alps: The Apertus Experience

cs.DC · 2026-04-14 · unverdicted · novelty 3.0

Apertus, a 70B open multilingual foundation model, was pre-trained on the Alps supercomputer, with details on adapting HPC infrastructure into a resilient ML platform.

citing papers explorer

Showing 8 of 8 citing papers after filters.

Efficient numeracy in language models through single-token number embeddings cs.LG · 2025-10-08 · unverdicted · none · ref 3
BitTokens represent numbers as single tokens via IEEE 754 binary format, allowing small language models to learn basic arithmetic algorithms nearly perfectly.
Active-GRPO: Adaptive Imitation and Self-Improving Reasoning for Molecular Optimization cs.LG · 2026-07-01 · unverdicted · none · ref 1
Active-GRPO reaches 0.1773 average SRxSim on TOMG-Bench MOLOPT by adaptively switching between imitation and self-reinforcement while upgrading references, outperforming GRPO and RePO.
LLM-ACES: Closed-Loop Discovery of Dynamical Systems with LLM-Guided Adaptive Search cs.LG · 2026-06-23 · unverdicted · none · ref 3
LLM-ACES is a closed-loop method that combines LLM-proposed operator priors with disagreement-driven adaptive data acquisition to discover governing ODEs, reporting lowest median NMSE and 46-52% symbolic accuracy on 122 systems.
LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs cs.LG · 2026-05-21 · unverdicted · none · ref 4
LLM-AutoSciLab proposes an LLM-driven closed-loop system for hypothesis generation and adaptive experiment selection that reports higher accuracy and 2-5x better sample efficiency than baselines on new chemistry and gene-network discovery benchmarks.
No Test Cases, No Problem: Distillation-Driven Code Generation for Scientific Workflows cs.SE · 2026-04-25 · unverdicted · none · ref 11
MOSAIC generates executable scientific code without I/O test cases by combining student-teacher distillation with a consolidated context window to reduce hallucinations across subproblems.
MOSAIC: Multi-agent Orchestration for Task-Intelligent Scientific Coding cs.CL · 2025-10-09 · unverdicted · none · ref 16
MOSAIC is a training-free multi-agent LLM framework with rationale, coding, reflection, and debugging agents plus a consolidated context window that outperforms prior methods on scientific coding benchmarks.
Prioritizing High-Consequence Biological Capabilities in Evaluations of Artificial Intelligence Models cs.CY · 2024-05-25 · unverdicted · none · ref 23
AI model evaluations for biological capabilities should prioritize high-consequence risks like pandemics, informed by life sciences dual-use experience, and occur prior to deployment to enable biosafety measures.
An Engineering Journey Training Large Language Models at Scale on Alps: The Apertus Experience cs.DC · 2026-04-14 · unverdicted · none · ref 4
Apertus, a 70B open multilingual foundation model, was pre-trained on the Alps supercomputer, with details on adapting HPC infrastructure into a resilient ML platform.

arXiv.org

fields

years

verdicts

representative citing papers

citing papers explorer