A stochastic approximation method.The Annals of Mathematical Statistics, pages 400–407

Herbert Robbins, Sutton Monro · 1951

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies

cs.LG · 2025-10-17 · unverdicted · novelty 7.0

Establishes last-iterate convergence rates for on-policy Q-learning under minimal irreducibility assumptions, with sample complexity O(1/ξ²) matching off-policy up to exploration factors.

\mathsf{VISTA}: Decentralized Machine Learning in Adversary Dominated Environments

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

VISTA adaptively tunes consistency thresholds in decentralized SGD so that the system converges asymptotically like standard SGD even when adversaries dominate the worker pool.

ASPIRE: Make Spectral Graph Collaborative Filtering Great Again via Adaptive Filter Learning

cs.IR · 2026-04-24 · unverdicted · novelty 6.0

ASPIRE learns adaptive graph filters via bi-level optimization to overcome low-frequency explosion bias in spectral collaborative filtering, achieving strong performance and stability.

FedSLoP: Memory-Efficient Federated Learning with Low-Rank Gradient Projection

cs.LG · 2026-04-27 · unverdicted · novelty 5.0

FedSLoP reduces communication and memory costs in federated learning through stochastic low-rank gradient projections, with a nonconvex convergence rate of O(1/sqrt(NT)) and competitive accuracy on heterogeneous MNIST data.

citing papers explorer

Showing 4 of 4 citing papers.

A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies cs.LG · 2025-10-17 · unverdicted · none · ref 7
Establishes last-iterate convergence rates for on-policy Q-learning under minimal irreducibility assumptions, with sample complexity O(1/ξ²) matching off-policy up to exploration factors.
\mathsf{VISTA}: Decentralized Machine Learning in Adversary Dominated Environments cs.LG · 2026-05-08 · unverdicted · none · ref 43
VISTA adaptively tunes consistency thresholds in decentralized SGD so that the system converges asymptotically like standard SGD even when adversaries dominate the worker pool.
ASPIRE: Make Spectral Graph Collaborative Filtering Great Again via Adaptive Filter Learning cs.IR · 2026-04-24 · unverdicted · none · ref 41
ASPIRE learns adaptive graph filters via bi-level optimization to overcome low-frequency explosion bias in spectral collaborative filtering, achieving strong performance and stability.
FedSLoP: Memory-Efficient Federated Learning with Low-Rank Gradient Projection cs.LG · 2026-04-27 · unverdicted · none · ref 27
FedSLoP reduces communication and memory costs in federated learning through stochastic low-rank gradient projections, with a nonconvex convergence rate of O(1/sqrt(NT)) and competitive accuracy on heterogeneous MNIST data.

A stochastic approximation method.The Annals of Mathematical Statistics, pages 400–407

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer