GUIDE integrates a Decision Transformer for joint modeling of bidding actions and states with Q-value regularization for exploration and an IDM for safe policy fallback, outperforming baselines in simulations and real Taobao deployment with gains in GMV, clicks, cost, and ROI.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
TRACE models post-click feedback as trajectories to dynamically refine conversion posteriors and uses a reliability-gated retrospective completer to guide early incomplete samples, outperforming prior delay modeling and reweighting approaches.
citing papers explorer
-
Generative Auto-Bidding with Unified Modeling and Exploration
GUIDE integrates a Decision Transformer for joint modeling of bidding actions and states with Q-value regularization for exploration and an IDM for safe policy fallback, outperforming baselines in simulations and real Taobao deployment with gains in GMV, clicks, cost, and ROI.
-
Follow the TRACE: Exploiting Post-Click Trajectories for Online Delayed Conversion Rate Prediction
TRACE models post-click feedback as trajectories to dynamically refine conversion posteriors and uses a reliability-gated retrospective completer to guide early incomplete samples, outperforming prior delay modeling and reweighting approaches.