The Secretary Problem with a Stochastic Precursor
Pith reviewed 2026-05-22 04:18 UTC · model grok-4.3
The pith
A content-free signal timed to arrive before the best candidate raises secretary problem success to at least one half.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A stochastic precursor that arrives no later than the best item changes the structure of optimal stopping. In random order a uniform precursor already yields success probability at least 1/2 and later precursors drive the probability to 1; in adversarial order sufficiently concentrated precursors recover constant-factor guarantees. These results follow from characterizing the optimal policies that use the precursor's observed arrival as a reference point for deciding whether to stop on later candidates.
What carries the argument
The stochastic precursor: a content-free signal whose arrival time is random but guaranteed to precede or coincide with the best item, used as a timing reference that restructures the stopping threshold.
If this is right
- Random-order success exceeds the classic 1/e benchmark even with minimal timing information.
- Success probability increases monotonically toward 1 as the precursor distribution shifts later.
- Adversarial-order instances regain constant success ratios once the precursor is concentrated enough.
- Optimal policies can be computed by conditioning thresholds on whether the precursor has been observed.
Where Pith is reading between the lines
- Similar timing signals could be added to other online selection or allocation problems to obtain improved guarantees without value predictions.
- The separation between content and timing suggests that learning-augmented algorithms might benefit from studying arrival-time distributions separately from prediction accuracy.
- Empirical tests on real arrival-time data from hiring or bidding platforms would check whether the modeled distributions match observed precursor behavior.
Load-bearing premise
The precursor is guaranteed to arrive no later than the best item.
What would settle it
A sequence in which the precursor arrives after the best item, or a distribution that places positive probability after the best item, would fall outside the model and could reduce success below the stated bounds.
Figures
read the original abstract
In learning-augmented online algorithms, predictions are usually valued for what they say: a value estimate, a solution, or an algorithmic recommendation. This paper shows that predictions can also be valuable solely due to their arrival time. We study the fundamental secretary problem augmented with a stochastic precursor: a content-free signal that is guaranteed to arrive no later than the best item, but is otherwise stochastically timed. The signal does not carry any additional information; nevertheless, its timing alone changes the structure of optimal stopping. We characterize optimal policies in the random-order and adversarial-order models. In random order, a single uniformly timed precursor already gives success probability at least $\frac12$, improving on the classic $\frac1e$ benchmark. With increasingly late precursors, the success probability approaches $1$. In adversarial order, for which traditional models do not admit strong guarantees, sufficiently concentrated precursors recover constant success guarantees. Our results show that such novel forms of asynchronous temporal information are a distinct and powerful form of advice in online decision making and may also be effective for other problems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper augments the classic secretary problem with a stochastic precursor: a content-free signal guaranteed to arrive no later than the best item but otherwise stochastically timed. It characterizes optimal stopping policies for both the random-order and adversarial-order models. In random order, a single uniformly timed precursor yields success probability at least 1/2 (improving on the classic 1/e), with later precursors approaching probability 1; in adversarial order, sufficiently concentrated precursors recover constant success guarantees.
Significance. If the characterizations and probability bounds hold, the work establishes that arrival timing alone constitutes a distinct and powerful form of advice in online algorithms, independent of any value or rank information carried by the signal. This provides a clean separation between temporal and informational aspects of predictions and extends constant-factor guarantees to the adversarial secretary setting, where traditional models admit none. The results are parameter-free in their core claims and offer falsifiable predictions for other online stopping problems.
major comments (2)
- [§3.2, Theorem 2] §3.2, Theorem 2: The lower bound of 1/2 for the uniform precursor in random order is derived by comparing the precursor arrival to the classic 1/e threshold; however, the argument appears to condition on the precursor occurring before the optimal stopping time without explicitly handling the measure-zero cases where the precursor coincides with the last item, which could affect the exact constant.
- [§5.1, Eq. (8)] §5.1, Eq. (8): The adversarial-order analysis claims constant success probability for concentrated precursors via a reduction to a two-phase stopping rule, but the concentration parameter is left as a free variable; the paper should state the explicit dependence of the constant on the concentration width to make the guarantee fully quantitative.
minor comments (3)
- [§2] The notation for the precursor arrival distribution (e.g., uniform vs. beta) is introduced in §2 but reused without redefinition in later sections; a single consolidated definition table would improve readability.
- [Figure 2] Figure 2 (random-order success curves) lacks error bars or sample-size information for the Monte-Carlo validation points; adding these would strengthen the empirical support for the analytic curves.
- [§1.2] The related-work discussion cites the classic secretary literature but omits recent learning-augmented secretary results (e.g., on value predictions); a brief comparison paragraph would clarify the novelty of the timing-only model.
Simulated Author's Rebuttal
We thank the referee for the positive evaluation and constructive comments on our work. We address each major comment below.
read point-by-point responses
-
Referee: [§3.2, Theorem 2] §3.2, Theorem 2: The lower bound of 1/2 for the uniform precursor in random order is derived by comparing the precursor arrival to the classic 1/e threshold; however, the argument appears to condition on the precursor occurring before the optimal stopping time without explicitly handling the measure-zero cases where the precursor coincides with the last item, which could affect the exact constant.
Authors: The lower bound in Theorem 2 is obtained by comparing the uniform precursor arrival time against the classic 1/e threshold. Because arrival times are drawn from a continuous distribution, the probability that the precursor arrives at exactly the same instant as the final item is zero. This null event therefore contributes nothing to the success probability and leaves the constant 1/2 unchanged. The conditioning step is valid almost surely. We do not plan to alter the proof but can insert a one-sentence clarification on the measure-zero event if the referee considers it helpful. revision: no
-
Referee: [§5.1, Eq. (8)] §5.1, Eq. (8): The adversarial-order analysis claims constant success probability for concentrated precursors via a reduction to a two-phase stopping rule, but the concentration parameter is left as a free variable; the paper should state the explicit dependence of the constant on the concentration width to make the guarantee fully quantitative.
Authors: We agree that an explicit functional dependence would make the guarantee fully quantitative. In the revised manuscript we will replace the current statement of Equation (8) with an explicit expression that shows how the success probability depends on the concentration width parameter, and we will add a short paragraph deriving the dependence from the two-phase analysis. revision: yes
Circularity Check
No circularity: results follow from direct model analysis
full rationale
The paper defines a stochastic precursor model (guaranteed arrival no later than the best item, otherwise random timing, content-free) and derives optimal stopping policies plus success probabilities (≥1/2 in random order, approaching 1 for late precursors, constant in adversarial order for concentrated precursors) via explicit characterization of the random-order and adversarial-order settings. No equations reduce a claimed prediction to a fitted input by construction, no self-citations are invoked as load-bearing uniqueness theorems, and no ansatz or renaming is presented as a derivation. The central claims are obtained by analyzing the augmented stopping problem from first principles on the stated distributions.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The precursor arrives no later than the best item but is otherwise stochastically timed and carries no value information.
invented entities (1)
-
stochastic precursor
no independent evidence
Reference graph
Works this paper leans on
- [1]
-
[2]
J. Adams. Early citation counts correlate with accumulated impact. In:Scientometrics 63 (2005), pp. 567–581
work page 2005
-
[3]
A. Agarwal and E. Balkanski. Learning-Augmented Dynamic Submodular Maximization. In:NeurIPS. 2024
work page 2024
-
[4]
S. Angelopoulos, M. Bienkowski, C. Dürr, and B. Simon. Contract Scheduling with Distributional and Multiple Advice. In:IJCAI. ijcai.org, 2024, pp. 3652–3660
work page 2024
-
[5]
A. Antoniadis, C. Coester, M. Eliás, A. Polak, and B. Simon. Online Metric Algorithms with Untrusted Predictions. In:ACM Trans. Algorithms19.2 (2023), 19:1–19:34
work page 2023
-
[6]
A. Antoniadis, T. Gouleakis, P. Kleer, and P. Kolev. Secretary and online matching problems with machine learned advice. In:Discret. Optim.48.Part 2 (2023), p. 100778
work page 2023
-
[7]
P. D. Azar, R. Kleinberg, and S. M. Weinberg. Prior independent mechanisms via prophet inequalities with limited information. In:Games Econ. Behav.118 (2019), pp. 511–532
work page 2019
-
[8]
Y. Azar, S. Leonardi, and N. Touitou. Flow time scheduling with uncertain processing time. In:STOC. ACM, 2021, pp. 1070–1080
work page 2021
-
[9]
E. Balkanski, W. Ma, and A. Maggiori. Fair Secretaries with Unfair Predictions. In: NeurIPS. 2024
work page 2024
- [10]
-
[11]
Z. Benomar, R. Cosson, A. Lindermayr, and J. Schlöter. Non-Clairvoyant Scheduling with Progress Bars. In:NeurIPS. 2025
work page 2025
-
[12]
H. Beyhaghi and L. Cai. Recent Developments in Pandora’s Box Problem: Variants and Applications. In:SIGecom Exch.21.1 (2023), pp. 20–34
work page 2023
-
[13]
A. Braun and S. Sarkar. The Secretary Problem with Predicted Additive Gap. In:NeurIPS. 2024
work page 2024
-
[14]
D. Choo, T. Gouleakis, C. K. Ling, and A. Bhattacharyya. Online bipartite matching with imperfect advice. In:ICML. 2024
work page 2024
- [15]
- [16]
- [17]
- [18]
- [19]
-
[20]
I. Diakonikolas, V. Kontonis, C. Tzamos, A. Vakilian, and N. Zarifis. Learning Online Algorithms with Distributional Advice. In:ICML. Vol. 139. PMLR, 2021, pp. 2687–2696
work page 2021
- [21]
-
[22]
P. Dütting, S. Lattanzi, R. P. Leme, and S. Vassilvitskii. Secretaries with Advice. In: Math. Oper. Res.49.2 (2024), pp. 856–879
work page 2024
-
[23]
E. B. Dynkin. The optimum choice of the instant for stopping a Markov process. In: Soviet Mathematics4 (1963), pp. 627–629
work page 1963
-
[24]
M. Dziallas and K. Blind. Innovation indicators throughout the innovation process: An extensive literature analysis. In:Technovation80-81 (2019), pp. 3–29
work page 2019
- [25]
-
[26]
H. Esfandiari, M. Hajiaghayi, B. Lucier, and M. Mitzenmacher. Prophets, Secretaries, and Maximizing the Probability of Choosing the Best. In:AISTATS. PMLR, 2020, pp. 3717– 3727
work page 2020
-
[27]
T. S. Ferguson. Who Solved the Secretary Problem. In:Stat. Sci.4 (1989), pp. 282–289
work page 1989
-
[28]
K. Fujii and Y. Yoshida. The Secretary Problem with Predictions. In:Math. Oper. Res. 49.2 (2024), pp. 1241–1262
work page 2024
-
[29]
J. P. Gilbert and F. Mosteller. Recognizing the Maximum of a Sequence. In:J. Amer. Statistical Assoc.61 (1966), pp. 35–73
work page 1966
- [30]
-
[31]
T. P. Hill and R. P. Kertz. A Survey of Prophet Inequalities in Optimal Stopping Theory. In:Contemporary mathematics125 (1992), pp. 191–207
work page 1992
-
[32]
Y. Hu, C. Hu, S. Fu, M. Fang, and W. Xu. Predicting Key Events in the Popularity Evolution of Online Information. In:PLoS ONE12 (2017)
work page 2017
- [33]
-
[34]
P. Joulani, A. Gyorgy, and C. Szepesvari. Online Learning under Delayed Feedback. In: ICML. PMLR, 2013, pp. 1453–1461
work page 2013
- [35]
-
[36]
H. Karisani, M. Daneshvaramoli, H. Beyhaghi, M. Hajiesmaili, and C. Musco. The Secretary Problem with Predictions and a Chosen Order. In:ITCS. Schloss Dagstuhl– Leibniz-Zentrum für Informatik, 2026, pp. 86–1
work page 2026
- [37]
-
[38]
S. Lattanzi, T. Lavastida, B. Moseley, and S. Vassilvitskii. Online Scheduling via Learned Weights. In:SODA. SIAM, 2020, pp. 1859–1877
work page 2020
-
[39]
A. Lindermayr and N. Megow. Permutation Predictions for Non-Clairvoyant Scheduling. In:ACM Trans. Parallel Comput.12.2 (2025), 4:1–4:26
work page 2025
-
[40]
A. Lindermayr and N. Megow.Repository of papers on algorithms with predictions.http: //algorithms-with-predictions.github.io/. 2026. 12
work page 2026
-
[41]
D. V. Lindley. Dynamic Programming and Decision Theory. In:J. Roy. Stat. Soc. C-app. 10 (1961), pp. 39–51
work page 1961
-
[42]
B. Lucier. An economic view of prophet inequalities. In:SIGecom Exch.16.1 (2017), pp. 24–47
work page 2017
-
[43]
T. Lykouris and S. Vassilvitskii. Competitive Caching with Machine Learned Advice. In: J. ACM68.4 (2021), 24:1–24:25
work page 2021
-
[44]
M. Mitzenmacher and S. Vassilvitskii. Algorithms with predictions. In:Commun. ACM 65.7 (2022), pp. 33–35
work page 2022
-
[45]
B. Moseley, H. Newman, K. Pruhs, and R. Zhou. Robust Gittins for Stochastic Scheduling. In:SIGMETRICS (Abstracts). ACM, 2025, pp. 166–168
work page 2025
-
[46]
H. Nourmohammadi, Y. Cao, B. Sun, and X. Tan. Ordinal Secretaries with Advice. In: AAAI. Vol. 40. 43. 2026, pp. 37108–37116
work page 2026
-
[47]
P. Nuti. The Secretary Problem with Distributions. In:IPCO. Springer, 2022, pp. 429–439
work page 2022
-
[48]
Y. Qian, Z. Zhang, P. Zhao, and Z.-H. Zhou. Learning with Asynchronous Labels. In: ACM Trans. Knowl. Discovery Data18 (2024), pp. 1–27
work page 2024
-
[49]
V. Raman and A. Tewari. Online Classification with Predictions. In:NeurIPS. 2024
work page 2024
-
[50]
OptimalSingle-ChoiceProphetInequalities from Samples
A.Rubinstein,J.Z.Wang,andS.M.Weinberg. OptimalSingle-ChoiceProphetInequalities from Samples. In:ITCS. Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2020, 60:1– 60:10
work page 2020
-
[51]
F. Spaeh and A. Ene. Online Ad Allocation with Predictions. In:NeurIPS. 2023
work page 2023
-
[52]
G. Szabo and B. A. Huberman. Predicting the popularity of online content. In:Commun. ACM53.8 (2010), pp. 80–88
work page 2010
- [53]
-
[54]
A. Wei. Better and Simpler Learning-Augmented Online Caching. In:APPROX-RANDOM. Vol. 176. LIPIcs. 2020, 60:1–60:17
work page 2020
- [55]
-
[56]
J. Yang and D. Zhan. Generalized Delayed Feedback Model with Post-Click Information in Recommender Systems. In:NeurIPS. 2022. A Omitted proofs from Section 2 This section is dedicated to giving the formal proofs omitted from Section 2. We start by proving the Bellman recursion. Lemma 1(Bellman recursion).Fix α > 0. There is an optimal policy that never st...
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.