← back to paper
arxiv: 2605.11347 · 2 revisions
Gradient-Free Noise Optimization for Reward Alignment in Generative Models