An alternating optimization method for bilevel problems under the polyak-lojasiewicz condition.Advances in Neural Information Processing Systems, 36:63847–63873, 2023a

Xiao, Q · 2023 · arXiv 2306.02422

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Second-Order Bilevel Optimization with Accelerated Convergence Rates

math.OC · 2026-05-07 · unverdicted · novelty 7.0

Second-order bilevel methods achieve Õ(ε^{-1.5}) iteration complexity for second-order stationary points, faster than first-order approaches, with a lazy variant improving computational efficiency by √d.

Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus

cs.LG · 2026-04-15 · conditional · novelty 6.0

CMAT uses a transformer decoder to produce a high-level consensus vector in latent space, enabling simultaneous order-independent actions by all agents and optimization via single-agent PPO, with superior results on StarCraft II, Multi-Agent MuJoCo, and Google Research Football.

Bilevel learning

math.OC · 2026-05-02 · unverdicted · novelty 2.0

Bilevel learning methods rely on implicit differentiation but are restricted by assumptions of unique lower-level solutions and struggle with constraints, and connections to broader bilevel optimization literature may enable more scalable general-purpose algorithms.

citing papers explorer

Showing 3 of 3 citing papers.

Second-Order Bilevel Optimization with Accelerated Convergence Rates math.OC · 2026-05-07 · unverdicted · none · ref 15
Second-order bilevel methods achieve Õ(ε^{-1.5}) iteration complexity for second-order stationary points, faster than first-order approaches, with a lazy variant improving computational efficiency by √d.
Bridging MARL to SARL: An Order-Independent Multi-Agent Transformer via Latent Consensus cs.LG · 2026-04-15 · conditional · none · ref 69
CMAT uses a transformer decoder to produce a high-level consensus vector in latent space, enabling simultaneous order-independent actions by all agents and optimization via single-agent PPO, with superior results on StarCraft II, Multi-Agent MuJoCo, and Google Research Football.
Bilevel learning math.OC · 2026-05-02 · unverdicted · none · ref 31
Bilevel learning methods rely on implicit differentiation but are restricted by assumptions of unique lower-level solutions and struggle with constraints, and connections to broader bilevel optimization literature may enable more scalable general-purpose algorithms.

An alternating optimization method for bilevel problems under the polyak-lojasiewicz condition.Advances in Neural Information Processing Systems, 36:63847–63873, 2023a

fields

years

verdicts

representative citing papers

citing papers explorer