Title resolution pending

· 1905 · arXiv 1905.08494

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Signature Approach for Contextual Bandits with Nonlinear and Path-dependent Rewards

cs.LG · 2026-05-11 · conditional · novelty 7.0

Signature transforms approximate path-dependent nonlinear rewards as linear functionals, enabling the DisSigUCB algorithm with a high-probability regret bound of order O(sqrt((d+m)KT)).

Anticipatory Reinforcement Learning: From Generative Path-Laws to Distributional Value Functions

cs.LG · 2026-04-06 · unverdicted · novelty 6.0

ARL lifts states into signature-augmented manifolds and employs self-consistent proxies of future path-laws to enable deterministic expected-return evaluation while preserving contraction mappings in jump-diffusion environments.

citing papers explorer

Showing 2 of 2 citing papers.

Signature Approach for Contextual Bandits with Nonlinear and Path-dependent Rewards cs.LG · 2026-05-11 · conditional · none · ref 5
Signature transforms approximate path-dependent nonlinear rewards as linear functionals, enabling the DisSigUCB algorithm with a high-probability regret bound of order O(sqrt((d+m)KT)).
Anticipatory Reinforcement Learning: From Generative Path-Laws to Distributional Value Functions cs.LG · 2026-04-06 · unverdicted · none · ref 3
ARL lifts states into signature-augmented manifolds and employs self-consistent proxies of future path-laws to enable deterministic expected-return evaluation while preserving contraction mappings in jump-diffusion environments.

Title resolution pending

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer