BitTokens represent numbers as single tokens via IEEE 754 binary format, allowing small language models to learn basic arithmetic algorithms nearly perfectly.
arXiv.org
6 Pith papers cite this work. Polarity classification is still indexing.
verdicts
UNVERDICTED 6representative citing papers
LLM-AutoSciLab proposes an LLM-driven closed-loop system for hypothesis generation and adaptive experiment selection that reports higher accuracy and 2-5x better sample efficiency than baselines on new chemistry and gene-network discovery benchmarks.
MOSAIC generates executable scientific code without I/O test cases by combining student-teacher distillation with a consolidated context window to reduce hallucinations across subproblems.
MOSAIC is a training-free multi-agent LLM framework with rationale, coding, reflection, and debugging agents plus a consolidated context window that outperforms prior methods on scientific coding benchmarks.
AI model evaluations for biological capabilities should prioritize high-consequence risks like pandemics, informed by life sciences dual-use experience, and occur prior to deployment to enable biosafety measures.
Apertus, a 70B open multilingual foundation model, was pre-trained on the Alps supercomputer, with details on adapting HPC infrastructure into a resilient ML platform.
citing papers explorer
-
LLM-AutoSciLab: Closed-Loop Scientific Discovery via Active Experimentation with LLMs
LLM-AutoSciLab proposes an LLM-driven closed-loop system for hypothesis generation and adaptive experiment selection that reports higher accuracy and 2-5x better sample efficiency than baselines on new chemistry and gene-network discovery benchmarks.
-
No Test Cases, No Problem: Distillation-Driven Code Generation for Scientific Workflows
MOSAIC generates executable scientific code without I/O test cases by combining student-teacher distillation with a consolidated context window to reduce hallucinations across subproblems.
-
An Engineering Journey Training Large Language Models at Scale on Alps: The Apertus Experience
Apertus, a 70B open multilingual foundation model, was pre-trained on the Alps supercomputer, with details on adapting HPC infrastructure into a resilient ML platform.