Risk-Sensitive Online Selection with Bounded Adaptivity
Pith reviewed 2026-05-17 03:17 UTC · model grok-4.3
The pith
A single random seed coordinates posted prices to improve lower-tail CVaR performance under bounded adaptivity.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We present a correlated posted-price mechanism that uses a single random seed to coordinate pricing decisions across time. This correlation induces a monotonic ordering of pricing profiles across sample paths, improving lower-tail performance while respecting the adaptivity constraint. The analysis develops a risk-sensitive randomized online primal-dual framework tailored to CVaR objectives and reveals a systematic trade-off between allowable adaptivity, risk sensitivity, and competitive performance.
What carries the argument
Correlated posted-price mechanism that shares one random seed across time steps to induce a monotonic ordering of pricing profiles across sample paths.
If this is right
- Competitive guarantees hold for both static and dynamic pricing across multiple regimes of the problem parameters.
- A clear trade-off emerges between the number of allowed policy updates, the degree of risk sensitivity, and the resulting competitive ratio.
- Empirical tests on airline pricing data show improved welfare concentration and reduced tail behavior compared with uncorrelated baselines.
Where Pith is reading between the lines
- The single-seed correlation idea could be tested in other online problems such as dynamic matching or inventory replenishment to limit downside outcomes.
- Varying the number of shared seeds might reveal a tunable knob between full adaptivity and strict tail-risk control.
- Real systems with update costs might adopt this structure to achieve reliable worst-case behavior without frequent re-optimization.
Load-bearing premise
Coordinating pricing decisions via a single random seed induces a monotonic ordering of pricing profiles that meaningfully improves lower-tail CVaR performance without violating the bounded-adaptivity constraint.
What would settle it
A concrete sequence of buyer valuations where the single-seed mechanism either produces a worse CVaR value than independent per-period randomization or requires more policy updates than the allowed bound.
Figures
read the original abstract
Designing randomized online algorithms that perform reliably not only in expectation but also under unfavorable realizations of randomness is a fundamental challenge in online decision-making. In this paper, we study this challenge in online adversarial selection, where a decision maker allocates $k$ units of a resource to sequentially arriving buyers through posted prices. We focus on two intertwined considerations that are often overlooked simultaneously: tail-risk sensitivity and bounded adaptivity, where tail risk is measured using conditional value-at-risk (CVaR) and bounded adaptivity limits the number of allowable policy updates over time. Our main contribution is a correlated posted-price mechanism that uses a single random seed to coordinate pricing decisions across time. This correlation induces a monotonic ordering of pricing profiles across sample paths, improving lower-tail performance while respecting the adaptivity constraint. More broadly, our results highlight correlation as a mechanism for controlling tail risk in randomized online algorithms. Using this framework, we derive competitive guarantees for several regimes of the problem under both static and dynamic pricing. Our analysis develops a risk-sensitive randomized online primal-dual framework tailored to CVaR objectives and reveals a systematic trade-off between allowable adaptivity, risk sensitivity, and competitive performance. Experiments on real airline pricing data further illustrate the empirical impact of correlated pricing on welfare concentration and tail behavior.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper studies risk-sensitive online adversarial selection with bounded adaptivity, where a decision maker allocates k units via posted prices to sequentially arriving buyers. It focuses on CVaR for tail risk and limits on policy updates. The main contribution is a correlated posted-price mechanism using a single random seed to coordinate decisions across time, inducing a monotonic ordering of pricing profiles across sample paths to improve lower-tail CVaR performance while respecting the adaptivity bound. The authors develop a risk-sensitive randomized online primal-dual framework for CVaR objectives, derive competitive guarantees for static and dynamic pricing regimes, and present experiments on airline pricing data showing impacts on welfare concentration and tail behavior.
Significance. If the central construction and analysis hold, the paper would offer a technically interesting approach to controlling tail risk in randomized online algorithms via correlation under adaptivity constraints. The risk-sensitive primal-dual framework and the identified trade-off between adaptivity, risk sensitivity, and competitive ratio could inform mechanism design in online resource allocation. The empirical results on real data provide some practical grounding, though their strength depends on the details of the data-handling and evaluation protocol.
major comments (2)
- [Abstract / main contribution] Abstract and main contribution paragraph: the claim that a single shared random seed produces a monotonic ordering of pricing profiles across sample paths, improving CVaR lower-tail performance while obeying the bounded-adaptivity constraint, is load-bearing for the central result. The skeptic's concern is valid on the given text: nothing shows that monotonicity is preserved when an adversarial arrival sequence forces an early policy update on one sample path but not another, both using the same seed. A concrete argument or proof that the limited number of updates suffices to maintain the ordering for every adversarial valuation sequence is required; without it the competitive guarantees for CVaR cannot be verified.
- [Framework description] Risk-sensitive randomized online primal-dual framework section: the abstract states that the framework is 'tailored to CVaR objectives' and yields competitive guarantees, yet provides no indication of how the CVaR objective is incorporated into the dual updates or how the single-seed correlation is analyzed within the primal-dual argument. If the performance bounds are derived only with respect to quantities internal to the same framework, the guarantees risk circularity and require an independent benchmark or external comparison to be convincing.
minor comments (2)
- [Abstract] The abstract would benefit from explicit statements of the competitive ratios obtained in the static and dynamic pricing regimes and the precise adaptivity bound (e.g., number of updates allowed).
- [Experiments] Experiments on airline data: the description mentions 'empirical impact' but does not specify the data-handling rules, number of runs, or error bars; adding these would improve reproducibility.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive review of our manuscript. We address the major comments point by point below and indicate the revisions we plan to make.
read point-by-point responses
-
Referee: [Abstract / main contribution] Abstract and main contribution paragraph: the claim that a single shared random seed produces a monotonic ordering of pricing profiles across sample paths, improving CVaR lower-tail performance while obeying the bounded-adaptivity constraint, is load-bearing for the central result. The skeptic's concern is valid on the given text: nothing shows that monotonicity is preserved when an adversarial arrival sequence forces an early policy update on one sample path but not another, both using the same seed. A concrete argument or proof that the limited number of updates suffices to maintain the ordering for every adversarial valuation sequence is required; without it the competitive guarantees for CVaR cannot be verified.
Authors: We agree that an explicit argument for monotonicity preservation under adversarial arrivals is required. In the revised manuscript we will add a dedicated lemma (with full proof) showing that the shared seed forces synchronous update triggers across paths up to the adaptivity bound; any divergence is limited by the update budget, and the pricing functions remain ordered by construction of the single-seed mechanism. This lemma will be referenced from the abstract and main contribution paragraph. revision: yes
-
Referee: [Framework description] Risk-sensitive randomized online primal-dual framework section: the abstract states that the framework is 'tailored to CVaR objectives' and yields competitive guarantees, yet provides no indication of how the CVaR objective is incorporated into the dual updates or how the single-seed correlation is analyzed within the primal-dual argument. If the performance bounds are derived only with respect to quantities internal to the same framework, the guarantees risk circularity and require an independent benchmark or external comparison to be convincing.
Authors: We will expand the framework section to make the CVaR incorporation explicit: the dual update is modified to use a CVaR-adjusted subgradient that weights tail realizations according to the risk level. The single-seed correlation is analyzed by coupling sample paths and bounding the joint distribution directly in the primal-dual potential. To remove any appearance of circularity we will add a direct comparison of the resulting ratios against the classical (non-CVaR) online primal-dual benchmark from the literature. revision: yes
Circularity Check
No circularity: claims rest on independent mechanism design and analysis
full rationale
The abstract and provided excerpts describe a new correlated posted-price mechanism using a single random seed to induce monotonic ordering, followed by derivation of competitive guarantees via a risk-sensitive randomized online primal-dual framework. No equations, fitted parameters renamed as predictions, or self-citations are quoted that reduce the central result to its own inputs by construction. The monotonicity and CVaR improvement are presented as consequences of the mechanism design rather than tautological redefinitions, and the analysis is described as developing a tailored framework with explicit trade-offs, making the derivation self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard assumptions of adversarial buyer arrivals and limited resource units hold for the online selection problem.
Reference graph
Works this paper leans on
-
[1]
Michael O. Ball and Maurice Queyranne. Toward robust revenue management: Competitive analysis of online booking.Operations Research, 57(4):950–963, 2009
work page 2009
-
[2]
Single-leg revenue management with advice
Santiago Balseiro, Christian Kroer, and Rachitesh Kumar. Single-leg revenue management with advice. InProceedings of the 24th ACM Conference on Economics and Computation, EC ’23, page 207, New York, NY, USA, 2023. Association for Computing Machinery
work page 2023
-
[3]
Optimal thompson sampling strategies for support-aware cvar bandits
Dorian Baudry, Romain Gautron, Emilie Kaufmann, and Odalric Maillard. Optimal thompson sampling strategies for support-aware cvar bandits. In Marina Meila and Tong Zhang, editors,Proceedings of the 38th International Conference on Machine Learning, volume 139 ofProceedings of Machine Learning Research, pages 716–726. PMLR, 18–24 Jul 2021
work page 2021
-
[4]
Boxiao Chen and Xiuli Chao. Parametric demand learning with limited price explorations in a backlog stochastic inventory system.IISE Transactions, 51(6):605–613, 2019
work page 2019
-
[5]
Boxiao Chen, Xiuli Chao, and Yining Wang. Technical note—data-based dynamic pricing and inventory control with censored demand and limited price changes.Operations Research, 68(5):1445–1456, 2020
work page 2020
-
[6]
Wang Chi Cheung, David Simchi-Levi, and He Wang. Technical note—dynamic pricing and demand learning with limited price experimentation.Operations Research, 65(6):1722–1731, 2017
work page 2017
-
[7]
Risk-sensitive and robust decision- making: a cvar optimization approach
Yinlam Chow, Aviv Tamar, Shie Mannor, and Marco Pavone. Risk-sensitive and robust decision- making: a cvar optimization approach. InProceedings of the 29th International Conference on Neural Information Processing Systems - Volume 1, NIPS’15, page 1522–1530, Cambridge, MA, USA, 2015. MIT Press
work page 2015
-
[8]
Risk-sensitive online algorithms (ex- tended abstract)
Nicolas Christianson, Bo Sun, Steven Low, and Adam Wierman. Risk-sensitive online algorithms (ex- tended abstract). In Shipra Agrawal and Aaron Roth, editors,Proceedings of Thirty Seventh Conference on Learning Theory, volume 247 ofProceedings of Machine Learning Research, pages 1140–1141. PMLR, 30 Jun–03 Jul 2024
work page 2024
-
[9]
Jos ´e Correa, Patricio Foncea, Ruben Hoeksma, Tim Oosterwijk, and Tjark Vredeveld. Posted price mechanisms and optimal threshold strategies for random arrivals.Mathematics of Operations Research, 46(4):1452–1478, 2021. 25
work page 2021
-
[10]
Devanur, Kamal Jain, and Robert D
Nikhil R. Devanur, Kamal Jain, and Robert D. Kleinberg. Randomized primal-dual analysis of ranking for online bipartite matching. SODA ’13, page 101–107, USA, 2013. Society for Industrial and Applied Mathematics
work page 2013
-
[11]
Cvar optimization for mdps: Existence and computation of optimal policies
Rui Ding and Eugene Feinberg. Cvar optimization for mdps: Existence and computation of optimal policies. 50(2):39–41, August 2022
work page 2022
-
[12]
Michael Dinitz, Sungjin Im, Thomas Lavastida, Benjamin Moseley, and Sergei Vassilvitskii.Control- ling Tail Risk in Online Ski-Rental, pages 4247–4263
-
[13]
An economics-based analysis of ranking for online bipartite matching
Alon Eden, Michal Feldman, Amos Fiat, and Kineret Segal. An economics-based analysis of ranking for online bipartite matching. InSymposium on Simplicity in Algorithms (SOSA), pages 107–110. SIAM, 2021
work page 2021
-
[14]
Jacob Feldman and Danny Segev. Dynamic pricing with menu costs: Approximation schemes and applications to grocery retail.Manufacturing & Service Operations Management, 27(4):1087–1106, 2025
work page 2025
-
[15]
Moran Feldman, Ola Svensson, and Rico Zenklusen. Online contention resolution schemes with applications to bayesian selection problems.SIAM Journal on Computing, 50(2):255–300, 2021
work page 2021
-
[16]
An Introduction in Discrete Time
Hans F ¨ollmer and Alexander Schied.Stochastic Finance. An Introduction in Discrete Time. 5th revised and extended edition.De Gruyter, 5th revised and extended edition edition, 2025
work page 2025
-
[17]
Improved online correlated selection.CoRR, abs/2106.04224, 2021
Ruiquan Gao, Zhongtian He, Zhiyi Huang, Zipei Nie, Bijun Yuan, and Yan Zhong. Improved online correlated selection.CoRR, abs/2106.04224, 2021
-
[18]
Real-time optimization of person- alized assortments.Management Science, 60(6):1532–1551, 2014
Negin Golrezaei, Hamid Nazerzadeh, and Paat Rusmevichientong. Real-time optimization of person- alized assortments.Management Science, 60(6):1532–1551, 2014
work page 2014
-
[19]
T. P. Hill and Robert P. Kertz. Comparisons of stop rule and supremum expectations of i.i.d. random variables.The Annals of Probability, 10(2):336–345, 1982
work page 1982
-
[20]
Posted price mechanisms for online allocation with diseconomies of scale
Hossein Nekouyan Jazi, Bo Sun, Raouf Boutaba, and Xiaoqi Tan. Posted price mechanisms for online allocation with diseconomies of scale. InProceedings of the ACM Web Conference 2025 (WWW ’25), New York, NY, USA, April 2025. ACM
work page 2025
-
[21]
Optimal algorithms for k-search with application in option pricing
Julian Lorenz, Konstantinos Panagiotou, and Angelika Steger. Optimal algorithms for k-search with application in option pricing. InProceedings of the 15th Annual European Conference on Algorithms, ESA’07, page 275–286, Berlin, Heidelberg, 2007. Springer-Verlag
work page 2007
-
[22]
Randomized rounding approaches to online allocation, sequencing, and matching, 2024
Will Ma. Randomized rounding approaches to online allocation, sequencing, and matching, 2024
work page 2024
-
[23]
Will Ma and David Simchi-Levi. Algorithms for online matching, assortment, and pricing with tight weight-dependent competitive ratios.Oper. Res., 68(6):1787–1803, November 2020
work page 2020
-
[24]
Will Ma, David Simchi-Levi, and Chung-Piaw Teo. On policies for single-leg revenue management with limited demand information.Operations Research, 69(1):207–226, 2021
work page 2021
-
[25]
Guanqun Ni. Competitive analysis of online revenue management with two hierarchical resources and multiple fare classes.PLOS ONE, 17(10):1–11, 10 2022
work page 2022
- [26]
-
[27]
Alexander Shapiro, Darinka Dentcheva, and A. Ruszczynski. Lectures on stochastic programming: Modeling and theory. 2009
work page 2009
-
[28]
Springer, New York, NY, 1 edition, 2011
Hal Smith.An Introduction to Delay Differential Equations with Applications to the Life Sciences, volume 57 ofTexts in Applied Mathematics. Springer, New York, NY, 1 edition, 2011
work page 2011
-
[29]
Online risk-averse submodular maximization
Tasuku Soma and Yuichi Yoshida. Online risk-averse submodular maximization. In Zhi-Hua Zhou, editor,Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, pages 2988–2994. International Joint Conferences on Artificial Intelligence Organization, 8 2021. Main Track
work page 2021
-
[30]
Static pricing for online selection problem and its variants
Bo Sun, Hossein Nekouyan Jazi, Xiaoqi Tan, and Raouf Boutaba. Static pricing for online selection problem and its variants. InThe 20th Conference on Web and Internet Economics, WINE 2024, Edinburgh, United Kingdom Proceedings ,December 2-5, 2024. Springer, 2024
work page 2024
-
[31]
Lui, Don Towsley, and Danny H.K
Bo Sun, Lin Yang, Mohammad Hajiesmaili, Adam Wierman, John C.S. Lui, Don Towsley, and Danny H.K. Tsang. The online knapsack problem with departures. InAbstract Proceedings of the 2023 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, SIGMETRICS ’23, page 59–60, New York, NY, USA, 2023. Association for Computing Machinery
work page 2023
-
[32]
Distributionally-aware explo- ration for CVaR bandits
Alex Tamkin, Ramtin Keramati, Christoph Dann, and Emma Brunskill. Distributionally-aware explo- ration for CVaR bandits. InNeurIPS 2019 Workshop on Safety and Robustness in Decision-making, Vancouver, Canada, Dec 2019. 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Workshop
work page 2019
-
[33]
Mechanism design for online resource allocation: A unified approach.Proc
Xiaoqi Tan, Bo Sun, Alberto Leon-Garcia, Yuan Wu, and D.H.K Tsang. Mechanism design for online resource allocation: A unified approach.Proc. ACM Meas. Anal. Comput. Syst. (SIGMETRICS ’20), 4(2), June 2020
work page 2020
-
[34]
Xiaoqi Tan, Siyuan Yu, Raouf Boutaba, and Alberto Leon-Garcia. Threshold policies with tight guarantees for online selection with convex costs.ACM Transactions on Economics and Computation, 13(2):1–49, 2025
work page 2025
-
[35]
Zbaracki, Mark Ritson, Daniel Levy, Shantanu Dutta, and Mark Bergen
Mark J. Zbaracki, Mark Ritson, Daniel Levy, Shantanu Dutta, and Mark Bergen. Managerial and customer costs of price adjustment: Direct evidence from industrial markets.The Review of Economics and Statistics, 86(2):514–533, 2004
work page 2004
-
[36]
Budget constrained bidding in keyword auctions and online knapsack problems
Yunhong Zhou, Deeparnab Chakrabarty, and Rajan Lukose. Budget constrained bidding in keyword auctions and online knapsack problems. InProceedings of the 17th International Conference on World Wide Web, WWW ’08, page 1243–1244, New York, NY, USA, 2008. Association for Computing Machinery. 27 A Proof of Lemma 1 Let𝐵 (𝑟2 ) 𝑡 be the set of buyers to whom a un...
work page 2008
-
[37]
Fix any𝑟∈ [0,1]and suppose𝑦 (𝑟) 𝑡 ′ < 𝑞 1 at the arrival of𝑡 ′. Since|𝐵 ∗ 2|=𝑞 2 ≥𝑞 1, and each buyer in𝐵 ∗ 2 has valuation at least𝜙 1(1)while the posted price for the first𝑞 1 units is𝜙 1(𝑅) ≤𝜙 1(1)(as𝜙 1 is increasing), every buyer in𝐵 ∗ 2 accepts the price for those first𝑞 1 units. Thus these𝑞 1 units are fully sold by the arrival of the last buyer in𝐵 ∗
-
[38]
Repeating the same argument inductively for all𝑖≤𝑖 ∗ yields, for any𝑟∈ [0,1],𝑦 (𝑟) ≥ Í𝑖∗ −1 𝑖=1 𝑞𝑖, as claimed. C Continuation of the Proof of Theorem 1 Let us now continue the proof of the feasibility of the dual constraints for each buyer𝑡by considering the remaining cases. Case II:𝑖=𝑖 ∗ and𝜙 −1 𝑖 (𝑣 𝑡 )> 𝑟 ∗. In this case, we consider the following two...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.