Title resolution pending

Sean P · 2024 · arXiv 2024.340964

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Target Updates May Stabilize Linear Q-Learning: Periodic and Soft Dynamics

stat.ML · 2026-05-31 · unverdicted · novelty 7.0

Periodic and soft target updates guarantee convergence in linear Q-learning to the exact projected Q-Bellman solution under spectral and step-size conditions via joint spectral radius analysis of switched linear systems.

A Switching System Theory of Q-Learning with Linear Function Approximation

cs.LG · 2026-05-10 · unverdicted · novelty 7.0 · 2 refs

Derives an exact linear switched model for the mean dynamics of Q-learning with linear function approximation and relates convergence to joint spectral radius stability of the switched system, extending the view to stochastic and regularized cases.

Geometrically Averaged Hard Target Updates for Linear Q-Learning

cs.LG · 2026-06-09 · unverdicted · novelty 6.0

Introduces and analyzes the λ-target update for linear Q-learning via geometric averaging of periodic target maps, studied with a switching-system model in the deterministic case.

A linear-quadratic partially observed Stackelberg stochastic differential game with multiple followers and its application to multi-agent formation control

math.OC · 2024-12-10 · unverdicted · novelty 5.0

Derives optimal strategies for a partially observed Stackelberg SDE game with asymmetric information and extends deterministic multi-agent formation control to the stochastic case.

citing papers explorer

Showing 4 of 4 citing papers.

Target Updates May Stabilize Linear Q-Learning: Periodic and Soft Dynamics stat.ML · 2026-05-31 · unverdicted · none · ref 22
Periodic and soft target updates guarantee convergence in linear Q-learning to the exact projected Q-Bellman solution under spectral and step-size conditions via joint spectral radius analysis of switched linear systems.
A Switching System Theory of Q-Learning with Linear Function Approximation cs.LG · 2026-05-10 · unverdicted · none · ref 26 · 2 links
Derives an exact linear switched model for the mean dynamics of Q-learning with linear function approximation and relates convergence to joint spectral radius stability of the switched system, extending the view to stochastic and regularized cases.
Geometrically Averaged Hard Target Updates for Linear Q-Learning cs.LG · 2026-06-09 · unverdicted · none · ref 22
Introduces and analyzes the λ-target update for linear Q-learning via geometric averaging of periodic target maps, studied with a switching-system model in the deterministic case.
A linear-quadratic partially observed Stackelberg stochastic differential game with multiple followers and its application to multi-agent formation control math.OC · 2024-12-10 · unverdicted · none · ref 24
Derives optimal strategies for a partially observed Stackelberg SDE game with asymmetric information and extends deterministic multi-agent formation control to the stochastic case.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer