The Portable Regime Score PRS=(B/|A|)(1-rho) captures and predicts acquisition function performance reversals in transfer Bayesian optimization, enabling a RegimePlanner that adapts and beats fixed baselines.
Russo Daniel, Van Roy
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3verdicts
UNVERDICTED 3representative citing papers
APEX maintains an explicit strategy space via a DAG with fork discovery and policy selection to sustain exploration in self-evolving LLM agents and reports outperformance on Jericho games and WebArena.
A software framework integrates heterogeneous causal inference, policy learning, mediation, forecasts, variance reduction, and anytime-valid inference into one AI-orchestratable interface for business experimentation.
citing papers explorer
-
Regime-Conditioned Evaluation in Multi-Context Bayesian Optimization
The Portable Regime Score PRS=(B/|A|)(1-rho) captures and predicts acquisition function performance reversals in transfer Bayesian optimization, enabling a RegimePlanner that adapts and beats fixed baselines.
-
APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents
APEX maintains an explicit strategy space via a DAG with fork discovery and policy selection to sustain exploration in self-evolving LLM agents and reports outperformance on Jericho games and WebArena.
-
Closing the Loop: A Software Framework for AI to Support Business Decision Making
A software framework integrates heterogeneous causal inference, policy learning, mediation, forecasts, variance reduction, and anytime-valid inference into one AI-orchestratable interface for business experimentation.