scBench-Long is a benchmark with 21 evaluations where the strongest AI model-harness pair succeeds on 25.4% of long-horizon single-cell biology tasks.
M., Mathur, M., Soderberg, C
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Modifies Gibbs sampler for GP state-space models, introduces CFA measurement structure, and validates software via simulation-based calibration to enable reliable learning of nonlinear latent dynamics.
citing papers explorer
-
scBench-Long: Verifiable Benchmarking of Long-Horizon Single-Cell Biology
scBench-Long is a benchmark with 21 evaluations where the strongest AI model-harness pair succeeds on 25.4% of long-horizon single-cell biology tasks.
-
Learning Nonlinear Dynamics: Improving the Estimation Efficiency and Reliability of Gaussian Process State-Space Models
Modifies Gibbs sampler for GP state-space models, introduces CFA measurement structure, and validates software via simulation-based calibration to enable reliable learning of nonlinear latent dynamics.