A unified recursion framework for stochastic variance-reduced estimation yields high-probability bounds and the first Õ(ε^{-3}) oracle complexity for stochastic optimization with expectation constraints.
hub Mixed citations
A stochastic approximation method.The annals of mathematical statistics, pages 400–407
Mixed citation behavior. Most common role is method (60%).
hub tools
citation-role summary
citation-polarity summary
representative citing papers
Develops quotient-categorical representations that render the average-reward distributional Bellman operator well-defined, non-expansive, and convergent under i.i.d. and Markovian sampling.
Finite-iteration guarantees are established for asynchronous scalar categorical TD in Cramér geometry and multivariate signed-categorical TD in MMD geometry under i.i.d., Markovian, and episodic sampling.
Large loss spikes in SGD are polynomially likely and serve as the dominant mechanism for escaping sharp minima toward flatter solutions in the NTK regime.
Classical momentum acceleration in mini-batch SGD for quadratics is proportional to batch size up to saturation, enabling perfect parallelization under minimal noise assumptions.
An alternative complementarity formulation for primal-dual interior-point methods keeps linear systems spectrally bounded near the solution, enabling stable single-precision solves and differentiation for bilevel and end-to-end learning.
R-SGD-Mini achieves O(1/T) convergence of expected squared gradient norm to a noise-dependent neighborhood in heavy-tailed settings by selecting the medoid gradient from M data chunks.
PSPO combines Bayesian posterior sampling of transition dynamics with constrained policy optimization to trade off generalization and robustness in offline RL.
A novel Bayesian copula-based model for joint multi-type spatio-temporal epidemic dynamics, with MCMC inference and validation on simulated data plus European meningococcal incidence records.
A market choice model with random-size sampling from past customers is represented as an elephant random walk variant, with proofs of almost sure convergence of S_n/n and regime-dependent distributional limits for scaled S_n.
A stochastic gradient algorithm learns log-optimal threshold-type strategies for online portfolio optimization across varied price dynamics.
citing papers explorer
-
Learning Threshold-Type Investment Strategies with Stochastic Gradient Method
A stochastic gradient algorithm learns log-optimal threshold-type strategies for online portfolio optimization across varied price dynamics.