A two-timescale framework for bilevel optimization: Complexity analysis and application to actor-critic

· 2020 · arXiv 2007.05170

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Optimal Sample Complexity for Single Time-Scale Actor-Critic with Momentum

cs.LG · 2026-02-02 · unverdicted · novelty 7.0

Single-timescale actor-critic with STORM momentum and a recent-sample buffer achieves optimal O(ε^{-2}) sample complexity for ε-optimal policies in finite discounted MDPs.

Single-loop approaches to nonsmooth bilevel optimisation

math.OC · 2026-06-17 · unverdicted · novelty 6.0

Develops optimistic and pessimistic calculus rules for set-valued bilevel constraints, derives nonsmooth adjoint inclusions, and proposes a convergent single-loop algorithm demonstrated on total variation inverse problems.

Continuous-Time Analysis for Minimax and Bilevel Problems

math.OC · 2026-05-20 · unverdicted · novelty 6.0

Introduces a modular unified Lyapunov template for continuous-time analysis of minimax, bilevel (via penalty), and min-min-max problems with explicit time-scale thresholds.

CHAL: Council of Hierarchical Agentic Language

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

CHAL is a multi-agent dialectic system that performs structured belief optimization over defeasible domains using Bayesian-inspired graph representations and configurable meta-cognitive value system hyperparameters.

citing papers explorer

Showing 4 of 4 citing papers.

Optimal Sample Complexity for Single Time-Scale Actor-Critic with Momentum cs.LG · 2026-02-02 · unverdicted · none · ref 26
Single-timescale actor-critic with STORM momentum and a recent-sample buffer achieves optimal O(ε^{-2}) sample complexity for ε-optimal policies in finite discounted MDPs.
Single-loop approaches to nonsmooth bilevel optimisation math.OC · 2026-06-17 · unverdicted · none · ref 28
Develops optimistic and pessimistic calculus rules for set-valued bilevel constraints, derives nonsmooth adjoint inclusions, and proposes a convergent single-loop algorithm demonstrated on total variation inverse problems.
Continuous-Time Analysis for Minimax and Bilevel Problems math.OC · 2026-05-20 · unverdicted · none · ref 12
Introduces a modular unified Lyapunov template for continuous-time analysis of minimax, bilevel (via penalty), and min-min-max problems with explicit time-scale thresholds.
CHAL: Council of Hierarchical Agentic Language cs.AI · 2026-05-12 · unverdicted · none · ref 71
CHAL is a multi-agent dialectic system that performs structured belief optimization over defeasible domains using Bayesian-inspired graph representations and configurable meta-cognitive value system hyperparameters.

A two-timescale framework for bilevel optimization: Complexity analysis and application to actor-critic

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer