When to Trust Your Model: Model-Based Policy Optimization , url =

Janner, Michael, Fu, Justin, Zhang, Marvin, Levine, Sergey , booktitle =

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Plan Before You Trade: Inference-Time Optimization for RL Trading Agents

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

FPILOT optimizes pre-trained RL trading policies at inference time using forecasted price trajectories to improve portfolio allocations and risk-adjusted returns on the DJ30 benchmark.

Is Conditional Generative Modeling all you need for Decision-Making?

cs.LG · 2022-11-28 · unverdicted · novelty 6.0

Return-conditional diffusion models for policies outperform offline RL on benchmarks by circumventing dynamic programming and enable constraint or skill composition.

citing papers explorer

Showing 2 of 2 citing papers.

Plan Before You Trade: Inference-Time Optimization for RL Trading Agents cs.LG · 2026-05-12 · unverdicted · none · ref 25
FPILOT optimizes pre-trained RL trading policies at inference time using forecasted price trajectories to improve portfolio allocations and risk-adjusted returns on the DJ30 benchmark.
Is Conditional Generative Modeling all you need for Decision-Making? cs.LG · 2022-11-28 · unverdicted · none · ref 223
Return-conditional diffusion models for policies outperform offline RL on benchmarks by circumventing dynamic programming and enable constraint or skill composition.

When to Trust Your Model: Model-Based Policy Optimization , url =

fields

years

verdicts

representative citing papers

citing papers explorer