Russo Daniel, Van Roy

Daniel Russo, Benjamin Van Roy, Abbas Kazerouni, Ian Osband, Zheng Wen · 2018 · DOI 10.1561/2200000070

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Regime-Conditioned Evaluation in Multi-Context Bayesian Optimization

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

The Portable Regime Score PRS=(B/|A|)(1-rho) captures and predicts acquisition function performance reversals in transfer Bayesian optimization, enabling a RegimePlanner that adapts and beats fixed baselines.

APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents

cs.LG · 2026-05-20 · unverdicted · novelty 5.0

APEX maintains an explicit strategy space via a DAG with fork discovery and policy selection to sustain exploration in self-evolving LLM agents and reports outperformance on Jericho games and WebArena.

Closing the Loop: A Software Framework for AI to Support Business Decision Making

cs.SE · 2026-04-27 · unverdicted · novelty 3.0

A software framework integrates heterogeneous causal inference, policy learning, mediation, forecasts, variance reduction, and anytime-valid inference into one AI-orchestratable interface for business experimentation.

citing papers explorer

Showing 3 of 3 citing papers.

Regime-Conditioned Evaluation in Multi-Context Bayesian Optimization cs.LG · 2026-05-06 · unverdicted · none · ref 41
The Portable Regime Score PRS=(B/|A|)(1-rho) captures and predicts acquisition function performance reversals in transfer Bayesian optimization, enabling a RegimePlanner that adapts and beats fixed baselines.
APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents cs.LG · 2026-05-20 · unverdicted · none · ref 22
APEX maintains an explicit strategy space via a DAG with fork discovery and policy selection to sustain exploration in self-evolving LLM agents and reports outperformance on Jericho games and WebArena.
Closing the Loop: A Software Framework for AI to Support Business Decision Making cs.SE · 2026-04-27 · unverdicted · none · ref 8
A software framework integrates heterogeneous causal inference, policy learning, mediation, forecasts, variance reduction, and anytime-valid inference into one AI-orchestratable interface for business experimentation.

Russo Daniel, Van Roy

fields

years

verdicts

representative citing papers

citing papers explorer