← back to paper
arxiv: 2602.05993 · 2 revisions
Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps