hub Mixed citations

A stochastic approximation method.The annals of mathematical statistics, pages 400–407

Herbert Robbins, Sutton Monro · 1951

Mixed citation behavior. Most common role is method (60%).

11 Pith papers citing it

Method 60% of classified citations

browse 11 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

method 3 background 2

citation-polarity summary

use method 3 background 2

representative citing papers

Unified High-Probability Analysis of Stochastic Variance-Reduced Estimation

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

A unified recursion framework for stochastic variance-reduced estimation yields high-probability bounds and the first Õ(ε^{-3}) oracle complexity for stochastic optimization with expectation constraints.

Quotient-Categorical Representations for Bellman-Compatible Average-Reward Distributional Reinforcement Learning

cs.LG · 2026-05-11 · unverdicted · novelty 7.0

Develops quotient-categorical representations that render the average-reward distributional Bellman operator well-defined, non-expansive, and convergent under i.i.d. and Markovian sampling.

A Finite-Iteration Theory for Asynchronous Categorical Distributional Temporal-Difference Learning

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

Finite-iteration guarantees are established for asynchronous scalar categorical TD in Cramér geometry and multivariate signed-categorical TD in MMD geometry under i.i.d., Markovian, and episodic sampling.

Large Spikes in Stochastic Gradient Descent: A Large-Deviations View

cs.LG · 2026-03-10 · unverdicted · novelty 7.0

Large loss spikes in SGD are polynomially likely and serve as the dominant mechanism for escaping sharp minima toward flatter solutions in the NTK regime.

Perfect Parallelization in Mini-Batch SGD with Classical Momentum Acceleration

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

Classical momentum acceleration in mini-batch SGD for quadratics is proportional to batch size up to saturation, enabling perfect parallelization under minimal noise assumptions.

A Differentiable Interior-Point Method in Single Precision

math.OC · 2026-05-18 · conditional · novelty 6.0

An alternative complementarity formulation for primal-dual interior-point methods keeps linear systems spectrally bounded near the solution, enabling stable single-precision solves and differentiation for bilevel and end-to-end learning.

Robust stochastic first order methods in heavy-tailed noise via medoid mini-batch gradient sampling

math.OC · 2026-05-08 · unverdicted · novelty 6.0

R-SGD-Mini achieves O(1/T) convergence of expected squared gradient norm to a noise-dependent neighborhood in heavy-tailed settings by selecting the medoid gradient from M data chunks.

Offline Policy Optimization with Posterior Sampling

cs.AI · 2026-05-08 · unverdicted · novelty 6.0

PSPO combines Bayesian posterior sampling of transition dynamics with constrained policy optimization to trade off generalization and robustness in offline RL.

Bayesian copula-based modelling for multi-type spatio-temporal epidemic data

stat.ME · 2026-05-05 · unverdicted · novelty 6.0

A novel Bayesian copula-based model for joint multi-type spatio-temporal epidemic dynamics, with MCMC inference and validation on simulated data plus European meningococcal incidence records.

Elephant random walk with attributed steps and extractions of random sizes

math.PR · 2026-04-19 · unverdicted · novelty 6.0

A market choice model with random-size sampling from past customers is represented as an elephant random walk variant, with proofs of almost sure convergence of S_n/n and regime-dependent distributional limits for scaled S_n.

Learning Threshold-Type Investment Strategies with Stochastic Gradient Method

q-fin.PM · 2019-07-04 · unverdicted · novelty 6.0

A stochastic gradient algorithm learns log-optimal threshold-type strategies for online portfolio optimization across varied price dynamics.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Learning Threshold-Type Investment Strategies with Stochastic Gradient Method q-fin.PM · 2019-07-04 · unverdicted · none · ref 5
A stochastic gradient algorithm learns log-optimal threshold-type strategies for online portfolio optimization across varied price dynamics.

A stochastic approximation method.The annals of mathematical statistics, pages 400–407

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer