Follow the TRACE: Exploiting Post-Click Trajectories for Online Delayed Conversion Rate Prediction
Pith reviewed 2026-05-08 08:34 UTC · model grok-4.3
The pith
Post-click trajectories allow refining conversion probabilities dynamically without awaiting delayed outcomes.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By formalizing the evolution of post-click behaviors as feedback trajectories, the method evaluates the alignment of accumulated feedback with conversion versus non-conversion, thereby dynamically refining posteriors. A reliability-gated retrospective completer leverages full-lifecycle data to supply adaptive guidance for unrevealed samples, counteracting early-stage sparsity.
What carries the argument
The feedback trajectory, which represents the accumulated post-click feedback status over time and serves to align partial observations with eventual conversion outcomes.
If this is right
- Better balance between data freshness and label accuracy in online conversion rate systems.
- Dynamic posterior refinement without waiting for final outcomes.
- The retrospective completer enhances existing prediction models in a model-agnostic way.
- Superior performance over state-of-the-art baselines in experiments.
Where Pith is reading between the lines
- Similar trajectory modeling could apply to other online learning tasks with delayed rewards, such as reinforcement learning in recommendation settings.
- Reducing reliance on long waiting periods might enable faster iteration in production recommendation engines.
- Investigating the impact on user privacy or data collection requirements could be a next step if trajectories replace some full observations.
Load-bearing premise
Partial post-click trajectories contain sufficient reliable signal to align with eventual conversion outcomes without introducing bias when using full-lifecycle data for guidance.
What would settle it
A test on a dataset where the alignment scores from trajectories do not correlate with actual conversion rates, or where adding the retrospective completer does not improve or worsens prediction metrics compared to baselines.
Figures
read the original abstract
Delayed feedback poses a core challenge for online CVR prediction, forcing a trade-off between label accuracy and data freshness. Existing methods address this through delay modeling or sample reweighting, yet neglect how post-click behaviors evolve over the observation period. To overcome this limitation, we formalize this evolution as feedback trajectory and propose TRACE. Instead of forcing hard labels on unrevealed samples, our method evaluates how well the accumulated feedback status aligns with conversion versus non-conversion, dynamically refining posteriors without waiting for final outcomes. To counteract early-stage trajectory sparsity, we further design a reliability-gated retrospective completer that leverages full-lifecycle data to provide adaptive posterior guidance for unrevealed samples. Extensive experiments validate TRACE's superiority over state-of-the-art baselines and confirm the retrospective completion module as a model-agnostic enhancer for existing systems. Our code is available at https://github.com/LunaZhangxy/TRACE.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes TRACE for online delayed conversion rate (CVR) prediction. It formalizes the evolution of post-click behaviors as feedback trajectories, evaluates the alignment of accumulated partial feedback status with conversion versus non-conversion to dynamically refine posteriors, and introduces a reliability-gated retrospective completer that leverages full-lifecycle data to supply adaptive guidance for unrevealed samples. The approach is positioned as model-agnostic and is claimed to outperform state-of-the-art baselines in extensive experiments.
Significance. If the claims hold without leakage or bias artifacts, TRACE could meaningfully advance delayed-feedback handling in online advertising and recommendation by moving beyond delay modeling or reweighting to exploit trajectory alignment signals. The model-agnostic enhancer property would be a practical strength, enabling plug-in improvements to existing CVR systems. The low soundness rating in the reader's assessment, however, indicates that experimental controls and ablation details are essential to establish whether gains reflect genuine trajectory information.
major comments (2)
- [Abstract] Abstract: The reliability-gated retrospective completer 'leverages full-lifecycle data to provide adaptive posterior guidance for unrevealed samples.' Because complete trajectories are unavailable at online decision time, this risks label leakage or distribution shift; any reported gains may not stem from partial-trajectory alignment but from implicit access to future outcomes. This directly threatens the central 'model-agnostic enhancer' claim and the assertion that partial trajectories reliably align with eventual outcomes.
- [Method / Experiments] Method / Experiments: The paper asserts that the completer supplies 'adaptive guidance' without waiting for final outcomes, yet the skeptic concern about systematic bias from full-lifecycle conditioning is not addressed by a concrete test (e.g., an ablation that trains the completer only on data observable at decision time). Without such a control, the superiority over baselines cannot be attributed to the proposed trajectory modeling.
minor comments (1)
- [Abstract] The abstract introduces 'feedback trajectory' and 'reliability-gated retrospective completer' without a concise formal definition or notation; adding a short mathematical sketch would improve immediate clarity.
Simulated Author's Rebuttal
We thank the referee for the thoughtful comments, which highlight important considerations for the soundness of our claims regarding the retrospective completer. We address each major point below with clarifications on the design and commitments to additional experiments.
read point-by-point responses
-
Referee: [Abstract] Abstract: The reliability-gated retrospective completer 'leverages full-lifecycle data to provide adaptive posterior guidance for unrevealed samples.' Because complete trajectories are unavailable at online decision time, this risks label leakage or distribution shift; any reported gains may not stem from partial-trajectory alignment but from implicit access to future outcomes. This directly threatens the central 'model-agnostic enhancer' claim and the assertion that partial trajectories reliably align with eventual outcomes.
Authors: We agree that the abstract phrasing risks misinterpretation. The retrospective completer is trained exclusively on historical full-lifecycle data to learn the relationship between partial trajectories and eventual conversion outcomes. At online inference time, it receives only the partial trajectory observed up to the decision point and outputs guidance derived from those learned patterns, with no access to future labels or outcomes. This is analogous to training delay models on past data while deploying them on current partial observations. We will revise the abstract to explicitly distinguish the offline training phase from the online inference phase, thereby reinforcing that no leakage occurs and that the model-agnostic enhancer property holds under standard online constraints. revision: yes
-
Referee: [Method / Experiments] Method / Experiments: The paper asserts that the completer supplies 'adaptive guidance' without waiting for final outcomes, yet the skeptic concern about systematic bias from full-lifecycle conditioning is not addressed by a concrete test (e.g., an ablation that trains the completer only on data observable at decision time). Without such a control, the superiority over baselines cannot be attributed to the proposed trajectory modeling.
Authors: We concur that an ablation isolating the completer's training data to only what is observable at decision time would provide stronger evidence. We will add this experiment to the revised manuscript: a variant of TRACE in which the completer is trained solely on partial trajectories available at the decision timestamp, compared against the full version. This control will allow us to quantify any contribution from full-lifecycle conditioning versus the trajectory alignment signals, directly addressing whether the reported gains can be attributed to the proposed method. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The paper introduces novel modeling constructs including the formalization of post-click behaviors as feedback trajectories and the reliability-gated retrospective completer that leverages full-lifecycle data for adaptive guidance. These elements are presented as original contributions that dynamically refine posteriors and enhance existing systems without reducing by construction to fitted parameters, self-defined quantities, or load-bearing self-citations. No equations or uniqueness theorems from prior author work are invoked in the abstract or description to force the central claims; the derivation remains self-contained with independent content. The skeptic concern regarding potential bias from full-lifecycle data pertains to empirical validity rather than circular reduction of the claimed derivation.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Partial post-click feedback trajectories contain alignment signals with eventual conversion outcomes that can be used to refine posteriors before final labels arrive
invented entities (2)
-
feedback trajectory
no independent evidence
-
reliability-gated retrospective completer
no independent evidence
Reference graph
Works this paper leans on
-
[1]
Ashish Agarwal, Kartik Hosanagar, and Michael D Smith. 2011. Location, location, location: An analysis of profitability of position in online advertising markets. Journal of marketing research48, 6 (2011), 1057–1073
work page 2011
-
[2]
Deepak Agarwal, Rahul Agrawal, Rajiv Khanna, and Nagaraj Kota. 2010. Esti- mating rates of rare events with multiple hierarchies through scalable log-linear models. InProceedings of the 16th ACM SIGKDD international conference on Knowl- edge discovery and data mining. 213–222
work page 2010
-
[3]
Abraham Bagherjeiran, Andrew O Hatch, and Adwait Ratnaparkhi. 2010. Ranking for the conversion funnel. InProceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval. 146–153
work page 2010
-
[4]
Olivier Chapelle. 2014. Modeling delayed feedback in display advertising. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. 1097–1105
work page 2014
-
[5]
Yu Chen, Jiaqi Jin, Hui Zhao, Pengjie Wang, Guojun Liu, Jian Xu, and Bo Zheng
-
[6]
InProceedings of the ACM Web Conference 2022
Asymptotically unbiased estimation for delayed feedback modeling via label correction. InProceedings of the ACM Web Conference 2022. 369–379
work page 2022
-
[7]
Siyu Gu, Xiang-Rong Sheng, Ying Fan, Guorui Zhou, and Xiaoqiang Zhu. 2021. Real negatives matter: Continuous training with real negatives for delayed feed- back modeling. InProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2890–2898
work page 2021
-
[8]
Chuan Guo, Geoff Pleiss, Yu Sun, and Kilian Q Weinberger. 2017. On calibration of modern neural networks. InInternational conference on machine learning. PMLR, 1321–1330
work page 2017
-
[9]
Yuyao Guo, Xiang Ao, Qiming Liu, and Qing He. 2023. Leveraging post-click user behaviors for calibrated conversion rate prediction under delayed feedback in online advertising. InProceedings of the 32nd ACM International Conference on Information and Knowledge Management. 3918–3922
work page 2023
-
[10]
Yuyao Guo, Haoming Li, Xiang Ao, Min Lu, Dapeng Liu, Lei Xiao, Jie Jiang, and Qing He. 2022. Calibrated Conversion Rate Prediction via Knowledge Distillation under Delayed Feedback in Online Advertising. InProceedings of the 31st ACM International Conference on Information & Knowledge Management. 3983–3987
work page 2022
-
[11]
Adam: A Method for Stochastic Optimization
Diederik P. Kingma and Jimmy Ba. 2014. Adam: A Method for Stochastic Opti- mization. arXiv:1412.6980 [cs.LG] https://arxiv.org/abs/1412.6980
work page internal anchor Pith review arXiv 2014
-
[12]
Sofia Ira Ktena, Alykhan Tejani, Lucas Theis, Pranay Kumar Myana, Deepak Dilipkumar, Ferenc Huszár, Steven Yoo, and Wenzhe Shi. 2019. Addressing delayed feedback for continuous training with neural networks in CTR prediction. InProceedings of the 13th ACM conference on recommender systems. 187–195
work page 2019
-
[13]
Kuang-chih Lee, Burkay Orten, Ali Dasdan, and Wentong Li. 2012. Estimating conversion rate in display advertising from past performance data. InProceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining. 768–776
work page 2012
-
[14]
Haoming Li, Feiyang Pan, Xiang Ao, Zhao Yang, Min Lu, Junwei Pan, Dapeng Liu, Lei Xiao, and Qing He. 2021. Follow the prophet: Accurate online conversion rate prediction in the face of delayed feedback. InProceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1915–1919
work page 2021
-
[15]
Bang Liu, Hanlin Zhang, Linglong Kong, and Di Niu. 2021. Factorizing historical user actions for next-day purchase prediction.ACM Transactions on the Web (TWEB)16, 1 (2021), 1–26
work page 2021
-
[16]
Qiming Liu, Xiang Ao, Yuyao Guo, and Qing He. 2024. Online conversion rate prediction via multi-interval screening and synthesizing under delayed feedback. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 38. 8796–8804
work page 2024
-
[17]
Qiming Liu, Haoming Li, Xiang Ao, Yuyao Guo, Zhihong Dong, Ruobing Zhang, Qiong Chen, Jianfeng Tong, and Qing He. 2023. Online Conversion Rate Predic- tion via Neural Satellite Networks in Delayed Feedback Advertising. InProceed- ings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1406–1415
work page 2023
-
[18]
Quan Lu, Shengjun Pan, Liang Wang, Junwei Pan, Fengdan Wan, and Hongxia Yang. 2017. A practical framework of conversion rate prediction for online display advertising. InProceedings of the ADKDD’17. 1–9
work page 2017
-
[19]
Vinod Nair and Geoffrey E Hinton. 2010. Rectified linear units improve re- stricted boltzmann machines. InProceedings of the 27th international conference on machine learning (ICML-10). 807–814
work page 2010
-
[20]
Feiyang Pan, Xiang Ao, Pingzhong Tang, Min Lu, Dapeng Liu, Lei Xiao, and Qing He. 2020. Field-aware calibration: a simple and empirically strong method for reliable probabilistic predictions. InProceedings of The Web Conference 2020. 729–739
work page 2020
-
[21]
Junwei Pan, Yizhi Mao, Alfonso Lobos Ruiz, Yu Sun, and Aaron Flores. 2019. Predicting different types of conversions with multi-task learning in online advertising. InProceedings of the 25th acm sigkdd international conference on knowledge discovery & data mining. 2689–2697
work page 2019
-
[22]
Hao Wang, Tai-Wei Chang, Tianqiao Liu, Jianmin Huang, Zhichao Chen, Chao Yu, Ruopeng Li, and Wei Chu. 2022. ESCM2: Entire space counterfactual multi- task model for post-click conversion rate estimation. InProceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 363–372
work page 2022
-
[23]
Yifan Wang, Peijie Sun, Min Zhang, Qinglin Jia, Jingjie Li, and Shaoping Ma
-
[24]
InProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
Unbiased delayed feedback label correction for conversion rate prediction. InProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2456–2466
- [25]
-
[26]
Hong Wen, Jing Zhang, Yuan Wang, Fuyu Lv, Wentian Bao, Quan Lin, and Keping Yang. 2020. Entire space multi-task modeling via post-click behavior decomposi- tion for conversion rate prediction. InProceedings of the 43rd International ACM SIGIR conference on research and development in Information Retrieval. 2377–2386
work page 2020
-
[27]
Jiaqi Yang and De-Chuan Zhan. 2022. Generalized delayed feedback model with post-click information in recommender systems.Advances in Neural Information Processing Systems35 (2022), 26192–26203
work page 2022
-
[28]
Jia-Qi Yang, Xiang Li, Shuguang Han, Tao Zhuang, De-Chuan Zhan, Xiaoyi Zeng, and Bin Tong. 2021. Capturing delayed feedback in conversion rate prediction via elapsed-time sampling. InProceedings of the AAAI Conference on Artificial Intelligence, Vol. 35. 4582–4589
work page 2021
-
[29]
Shota Yasui, Gota Morishita, Fujita Komei, and Masashi Shibata. 2020. A feed- back shift correction in predicting conversion rates under delayed feedback. In Proceedings of The Web Conference 2020. 2740–2746
work page 2020
- [30]
-
[31]
Weinan Zhang, Shuai Yuan, and Jun Wang. 2014. Optimal real-time bidding for display advertising. InProceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining. 1077–1086
work page 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.