pith. sign in

arxiv: 2503.16737 · v7 · submitted 2025-03-20 · 📊 stat.ML · cs.LG· math.PR· math.ST· stat.TH

Revenue Maximization Under Sequential Price Competition Via The Estimation Of s-Concave Demand Functions

Pith reviewed 2026-05-22 22:45 UTC · model grok-4.3

classification 📊 stat.ML cs.LGmath.PRmath.STstat.TH
keywords dynamic pricingprice competitions-concavityregret boundsNash equilibriumsemi-parametric estimationshape-constrained estimation
0
0 comments X

The pith

A semi-parametric least-squares policy lets competing sellers converge to Nash prices at rate O(T^{-1/7}) while incurring O(T^{5/7}) regret each.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a dynamic pricing policy for multiple sellers who set prices simultaneously each period without knowing the nonlinear demand functions that depend on all prices. Sellers use semi-parametric least-squares estimation of their demands under the assumption that those functions are s-concave. When all sellers follow the policy, individual prices converge to the Nash equilibrium prices that would arise under full information at rate O(T^{-1/7}). Each seller's cumulative regret against a dynamic benchmark is bounded by O(T^{5/7}). The work also proves equilibrium existence under s-concavity and derives new concentration bounds for the shape-constrained estimator.

Core claim

Under s-concave demand functions, the proposed semi-parametric least-squares dynamic pricing policy drives prices to the full-information Nash equilibrium at rate O(T^{-1/7}) and bounds each seller's regret by O(T^{5/7}).

What carries the argument

s-concavity of demand functions, which guarantees equilibrium existence and supplies the shape constraint used both for estimation and for deriving the convergence and regret rates via semi-parametric least-squares.

If this is right

  • Sellers can approach competitive equilibrium prices without observing rivals' demands or knowing the demand mapping explicitly.
  • The same policy yields both asymptotic price convergence and sublinear regret against a dynamic benchmark.
  • New concentration inequalities hold for least-squares estimators under shape constraints.
  • Equilibrium existence follows directly from the s-concavity assumption on the demand system.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The approach may extend to other shape restrictions that also guarantee equilibrium existence, such as log-concavity.
  • The rates could be tested in laboratory experiments where human subjects play the pricing game with unknown demand.
  • Similar semi-parametric methods might apply to other repeated strategic interactions such as quantity competition or auction bidding.

Load-bearing premise

The demand function of each seller is s-concave.

What would settle it

A numerical simulation in which demand functions violate s-concavity while all other modeling assumptions hold and the observed convergence rate is slower than O(T^{-1/7}).

Figures

Figures reproduced from arXiv: 2503.16737 by Cong Shi, Daniele Bracale, Moulinath Banerjee, Yuekai Sun.

Figure 4
Figure 4. Figure 4: Illustration of our policy (Algorithm 1) with N = 4 sellers in sequential price competition under nonlinear demands. For each seller i ∈ {1, 2, 3, 4}, in their exploration phase (dotted line) of length τi , they offer randomized prices following their distribution Di . Within the exploration phase, each seller has a private phase for estimating θi (blue box with dotted border), with length τiκi and a priva… view at source ↗
Figure 5
Figure 5. Figure 5: Performance of Algorithm 1 in sequential price competition with N ∈ {2, 4, 6} sellers for different values of the contraction constant LΓ. 5 6 7 8 −3.0 −2.5 −2.0 −1.5 −1.0 N=2 log kp (T) − p ?k2 m=-0.23 m=-0.16 m=-0.44 5 6 7 8 2 3 4 5 log Total Expected Regret m=0.66 m=0.71 m=0.60 5 6 7 8 −1.5 −1.0 −0.5 0.0 N=4 m=-0.30 m=-0.32 m=-0.43 5 6 7 8 3 4 5 m=0.55 m=0.59 m=0.54 5 6 7 8 log T −1.0 −0.5 0.0 N=6 m=-0.… view at source ↗
Figure 6
Figure 6. Figure 6: Log-log performance of Algorithm 1 in sequential price competition with N ∈ {2, 4, 6} sellers for different values of the contraction constant LΓ. The slopes, indicated as m, of the convergence to NE and the regret are always smaller than the corresponding theoretical upper bounds (−1/7 and 5/7). 51 [PITH_FULL_IMAGE:figures/full_fig_p051_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Performance in the different exploration phases. [PITH_FULL_IMAGE:figures/full_fig_p052_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Log-log performance of Algorithm 1 in sequential price competition with N = 4 sellers for different values of misspecification of si = 0, specifically {−0.2, −0.1, 0, 0.1, 0.2}. The slopes, indicated as m, of the convergence to NE and the regret are close to each other. The small variations in the rates can be attributed to the finite sample experiment (T = 1600). 52 [PITH_FULL_IMAGE:figures/full_fig_p052… view at source ↗
Figure 9
Figure 9. Figure 9: Illustration of our policy (Algorithm 1) with N = 4 sellers in sequential price competition under nonlinear demands. For each seller i ∈ {1, 2, 3, 4}, in their exploration phase (dotted line) of length τ , they offer randomized prices following their distribution Di . Within the exploration phase, each seller has a private phase for estimating θi (blue box), with length τκi and a private phase for estimati… view at source ↗
Figure 10
Figure 10. Figure 10: (by Bagnoli & Bergstrom (2006)) Distributions with log-concave density functions (distribution functions marked ∗ lack a closed-form representation). 54 [PITH_FULL_IMAGE:figures/full_fig_p054_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: , by Bagnoli & Bergstrom (2006) summarizes, for each of these cases, whether the density and the CDF are log-concave or log-convex [PITH_FULL_IMAGE:figures/full_fig_p055_11.png] view at source ↗
read the original abstract

We consider price competition among multiple sellers over a selling horizon of $T$ periods. In each period, sellers simultaneously offer their prices (which are made public) and subsequently observe their respective demand (not made public). The demand function of each seller depends on all sellers' prices through a private, unknown, and nonlinear relationship. We propose a dynamic pricing policy that uses semi-parametric least-squares estimation and show that when the sellers employ our policy, their prices converge at a rate of $O(T^{-1/7})$ to the Nash equilibrium prices that sellers would reach if they were fully informed. Each seller incurs a regret of $O(T^{5/7})$ relative to a dynamic benchmark policy. A theoretical contribution of our work is proving the existence of equilibrium under shape-constrained demand functions via the concept of $s$-concavity and establishing regret bounds of our proposed policy. Technically, we also establish new concentration results for the least squares estimator under shape constraints. Our findings offer significant insights into dynamic competition-aware pricing and contribute to the broader study of non-parametric learning in strategic decision-making.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The manuscript considers multi-seller sequential price competition over a horizon of T periods, where each seller's demand is an unknown nonlinear function of all prices. It proposes a semi-parametric least-squares dynamic pricing policy and claims that, when all sellers adopt it, prices converge to the fully-informed Nash equilibrium at rate O(T^{-1/7}) while each seller incurs regret O(T^{5/7}) against a dynamic benchmark. The paper proves existence of equilibrium under the maintained s-concavity shape constraint on demands and derives new concentration inequalities for the shape-constrained least-squares estimator.

Significance. If the central claims hold, the work supplies the first explicit convergence and regret rates for competitive dynamic pricing under a shape constraint that simultaneously guarantees equilibrium existence and enables the statistical bounds. The new concentration results for shape-constrained estimation constitute a technical contribution with potential applicability to other non-parametric learning problems in games.

minor comments (2)
  1. [Abstract] Abstract: the statement of the convergence rate O(T^{-1/7}) and regret O(T^{5/7}) would benefit from a parenthetical note on the dependence (or independence) of the hidden constants on the s-concavity parameter and the dimension of the price vector.
  2. [Regret analysis] The manuscript would be strengthened by an explicit statement, in the section deriving the regret bound, of how the s-concavity parameter enters the final O(T^{5/7}) expression (or why it does not).

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive assessment of the manuscript, accurate summary of the contributions on s-concave demand estimation in sequential price competition, and recommendation for minor revision. The significance statement correctly highlights the novelty of the convergence rates, regret bounds, and new concentration inequalities for shape-constrained estimators.

Circularity Check

0 steps flagged

No significant circularity identified

full rationale

The paper maintains s-concavity as an explicit shape constraint on demand functions and derives equilibrium existence plus new concentration inequalities for the semi-parametric least-squares estimator directly from that assumption. The stated O(T^{-1/7}) convergence and O(T^{5/7}) regret bounds are obtained from these new technical results rather than by renaming fitted quantities or reducing to self-citations. No load-bearing step equates a claimed prediction to an input by construction; the derivation chain is self-contained against the maintained assumptions and external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claims rest on the domain assumption that demand functions are s-concave; no free parameters or invented entities are mentioned in the abstract.

axioms (1)
  • domain assumption Demand functions satisfy s-concavity
    Invoked to establish existence of Nash equilibrium and to obtain the stated convergence and regret rates (abstract, theoretical contribution paragraph).

pith-pipeline@v0.9.0 · 5739 in / 1186 out tokens · 38360 ms · 2026-05-22T22:45:46.661341+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Harnessing Unimodality in Semiparametric Contextual Pricing via Oracle Price Map Learning

    stat.ML 2026-05 unverdicted novelty 7.0

    ORBIT learns the (β-1)-smooth oracle price map via local polynomial approximation and bandit convex optimization in a semiparametric contextual pricing model, achieving regret Õ(T^{(2β-1)/(4β-3)} + √(dT)) with a match...

  2. Equilibrium and Pricing in Consumer Networks with Nonlinear Utilities: An Online Shape-Constrained Learning Approach

    math.ST 2026-05 unverdicted novelty 7.0

    The paper establishes equilibrium existence and uniqueness for nonlinear utility consumer networks under contraction conditions and proposes a shape-constrained isotonic regression approach with strict no-regret conve...

Reference graph

Works this paper leans on

23 extracted references · 23 canonical work pages · cited by 2 Pith papers

  1. [1]

    In particular, this assumption is invoked in Lemma E.2 to derive the normal equations (the same assumption can be found in Balabdaoui et al

    The requirement that exploration prices follow an elliptically symmetric distribution is used to guarantee the consistency of the estimator of θi in the single-index model. In particular, this assumption is invoked in Lemma E.2 to derive the normal equations (the same assumption can be found in Balabdaoui et al. (2019) and Brillinger (2012)). If the explo...

  2. [2]

    It ensures that ψi,θ(u) (defined in (18)) depends on u solely through the argument of ψi rather than that of g (see Equation (29))

    As we will discuss in Appendix D.2, the property that g(x + y) = g(x)g(y) (satisfied, for example, by Gaussian distributions) supports the estimation of ψi. It ensures that ψi,θ(u) (defined in (18)) depends on u solely through the argument of ψi rather than that of g (see Equation (29)). This decoupling guarantees that ψi,θ inherits key properties from ψi...

  3. [3]

    From the constraint in Equation (27): u − θ⊤m = v⊤ v ∥v∥2 y1 + α ⇒ u − θ⊤m = ∥v∥2y1 ⇒ y1 = u − θ⊤m ∥v∥2 . Since the original density in the y-space is fy(y) ∝ g(y2 1 + ∥α∥2 2), when conditioning on y1, the conditional density on the (N − 1)-dimensional space of α is: fα|v⊤y=u−θ⊤m(α) ∝ g u − θ⊤m ∥v∥2 2 + ∥α∥2 2 ! . Now, using the assumption that g(x + y) =...

  4. [4]

    (37) Lemma G.1. If Assumptions in the Theorem hold, for T sufficiently large we have E[supp∈P |ψi(θ⊤ i p) − bψi, eθi (eθ⊤ i p)|] ≲ Xi √ N( log τ τ )2/5, where Xi = max {Ci, Bi}, and Bi and Ci are defined in (17) and (21), respectively. Moreover, there exists a unique valueκ⋆ i ∈ (0, 1) that minimizes E[supp∈P |ψi(θ⊤ i p) − bψi, eθi (eθ⊤ i p)|]. This value...

  5. [5]

    Thus, LΓ ↑ 1 as β1 ↓ q N −1 N

    < β 2 1 ⇐ ⇒ β2 1 > N − 1 N . Thus, LΓ ↑ 1 as β1 ↓ q N −1 N . In particular, we have the following scheme: N = 2 : β1 ↓ q 1 2 ≈ 0.707 = ⇒ LΓ ↑ 1, N = 4 : β1 ↓ q 3 4 ≈ 0.866 = ⇒ LΓ ↑ 1, N = 6 : β1 ↓ q 5 6 ≈ 0.913 = ⇒ LΓ ↑ 1. A summary of the two extreme cases can be found in Table 3. Simulation design. Given the discussion above, and the table in Table 3, f...

  6. [6]

    The Cauchy PDF is (−1/2)-optimal-concave (independently of scaling and location param- eters)

  7. [7]

    The Pareto PDF is − 1 α+1 -optimal-concave, (independently of the location), where α > 0 is the scaling factor

  8. [8]

    L.3.2 s-CONCAVE CDF S AND SURVIVAL FUNCTIONS Let F be a CDF and ¯F = 1 − F its survival function, and let f = F ′

    The log-normal PDF with parameters (µ, σ2) is − σ2 4 -optimal-concave, (independently of µ). L.3.2 s-CONCAVE CDF S AND SURVIVAL FUNCTIONS Let F be a CDF and ¯F = 1 − F its survival function, and let f = F ′. When {u : f(u) > 0} = R, for similar reasoning as in Remark L.5, a necessary condition for having ϕ = ds ◦ F or ϕ = ds ◦ ¯F concave, is that s ≤ 0. I...

  9. [9]

    56 Published as a conference paper at ICLR 2026

    If f(a) = 0, f ′(a) = 0 and F is s-concave, then s ≤ 1/2. 56 Published as a conference paper at ICLR 2026

  10. [10]

    In the following proposition, we prove that, under certain conditions, when a density function is s-concave, then the CDF and the survival function is µ-concave for some µ = µ(s)

    If f(b) = 0, f ′(b) = 0 and ¯F is s-concave, then s ≤ 1/2. In the following proposition, we prove that, under certain conditions, when a density function is s-concave, then the CDF and the survival function is µ-concave for some µ = µ(s). The proof of Proposition L.8 is deferred to Appendix L.3.7. Proposition L.8. Fix a function f : (a, b) 7→ (0, ∞) conti...

  11. [11]

    If f is s-concave on (a, b) with f(a) ̸= 0 and s > −1, then F is µ-concave for all µ ≤ 1 − 1 s+1

  12. [12]

    If f is s-concave on (a, b) with f(a) = 0 and s ̸= −1, then F is µ-concave for all µ ≤ 1 − 1 s+1

  13. [13]

    The proof of Proposition L.9 is deferred to Appendix L.3.8

    If f is monotone decreasing, then F is s-concave for any s < 1. The proof of Proposition L.9 is deferred to Appendix L.3.8. Proposition L.9. Fix a function f : (a, b) 7→ [0, ∞) continuously differentiable, and let F (u) =R u a f(t)dt for all x ∈ (a, b) and define f(b) = limu→b f(u). Then:

  14. [14]

    If f is s-concave on (a, b) with f(b) ̸= 0 and s > −1, then ¯F is µ-concave for all µ ≤ 1 − 1 s+1

  15. [15]

    If f is s-concave on (a, b) with f(b) = 0 and s ̸= −1, then ¯F is µ-concave for all µ ≤ 1 − 1 s+1

  16. [16]

    The special case s = 0 was proved by Bagnoli & Bergstrom (2006)

    If f is monotone decreasing, then ¯F is s-concave for any s < 1. The special case s = 0 was proved by Bagnoli & Bergstrom (2006). L.3.3 s-CONCAVE CDF S THAT ARE NOT LOG -CONCAVE From Figure 11 we already know that the CDFs of thePareto, Lognormal. Student’s t, Cauchyare not log-concave. However, by our Proposition L.9 we immediately get that they are s∗-o...

  17. [17]

    The Cauchy CDF and survival function are µ-concave for any µ ≤ 1 (independently of scaling and location parameters)

  18. [18]

    The Pareto CDF and survival function areµ-concave for any µ ≤ − 1 α, (independently of the location), where α is the scaling factor

  19. [19]

    From Proposition L.8 (and Proposition L.9), if f is µ⋆-optimal-concave it does not necessarily means that F (and ¯F ) is s(µ∗) = 1 − 1 µ∗+1 -optimal concave

    The log-normal CDF and survival function are with parameters (µ, σ2) are µ-concave for any µ ≤ σ2 σ2−4, (independently of µ). From Proposition L.8 (and Proposition L.9), if f is µ⋆-optimal-concave it does not necessarily means that F (and ¯F ) is s(µ∗) = 1 − 1 µ∗+1 -optimal concave. Indeed, the optimal concave value can be larger than s(µ∗). However, for ...

  20. [20]

    Fix α > 1 and set ϑ(x) = xα

    Power function example. Fix α > 1 and set ϑ(x) = xα. Then ϑ′(x) = αxα−1, ϑ ′′(x) = α(α − 1)xα−2, and ϑ′′(x) + s(ϑ′(x))2 = αxα−2 h (α − 1) + sαxα i . Since xα ≤ 1 on (0, 1), a sufficient (and sharp) condition is s ≤ − α−1 α . Hence F (x) = exp(xα) is s∗-concave for s∗ = − α−1 α , but not log-concave because ϑ′′(x) > 0

  21. [21]

    4x2 ν2 − ν + 1 2 − ν + 1 2 − 1 1 + x2 ν − ν+1 2 −2 + 2 ν − ν + 1 2 1 + x2 ν − ν+1 2 −1# + (s − 1)

    Exponential example. Fix k > 0 and set ϑ(x) = ekx. Then ϑ′(x) = kekx, ϑ ′′(x) = k2ekx, and ϑ′′(x) + s(ϑ′(x))2 = k2ekx 1 + sekx . Since ekx ≤ ek on (0, 1), a sufficient and sharp condition is s ≤ −e−k. Thus F (x) = exp ekx is s∗-concave for s∗ = −e−k, but not log-concave since ϑ′′(x) > 0. L.3.5 P ROOF OF PROPOSITION L.6 Proof of a). We have that f(x) = Γ( ...

  22. [22]

    However, since this inequality has to hold for all x, we can assume x0 = 0. The characterization translates to 1 + x2 γ2 −1" (−1) 2 γ2 1 + x2 γ2 −2 + 8x2 γ4 1 + x2 γ2 −3# + (s − 1) (−1)2x γ2 1 + x2 γ2 −2!2 ≤ 0 iff (−1) 2 γ2 1 + x2 γ2 −2 + 8x2 γ4 1 + x2 γ2 −3 + (s − 1)4x2 γ4 1 + x2 γ2 −3 ≤ 0 iff (−1) 2 γ2 1 + x2 γ2 + 8x2 γ4 + (s − 1)4x2 γ4 ≤ 0 iff −2x2 γ4 ...

  23. [23]

    L.3.7 P ROOF OF PROPOSITION L.8 Proof

    Similar proof hold for ¯F . L.3.7 P ROOF OF PROPOSITION L.8 Proof. We need to prove thatF · f ′ + (µ − 1)f2 ≤ 0. Using that f s−1 · f ′ is non-increasing (because (ds ◦ f)′′ ≤ 0 i.e. (ds ◦ f)′ = f s−1 · f is non-increasing) we have f ′(u) f(u) F (u) = f s−1(u) f(u) f ′(u) f s−1(u) Z u a f(t)dt ≤ 1 f(u)f s−1(u) Z u a f s−1(t)f ′(t)f(t)dt = 1 f(u)f s−1(u) f...