Accelerating multi-task temporal difference learning under low-rank representation

· 2025 · arXiv 2503.02030

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards

cs.LG · 2026-04-04 · unverdicted · novelty 7.0

A low-rank matrix estimation method in a reward-free RL framework learns shared representations across linear MDPs and yields near-optimal policies with characterized regret bounds under relaxed feature assumptions.

citing papers explorer

Showing 1 of 1 citing paper.

Provable Multi-Task Reinforcement Learning: A Representation Learning Framework with Low Rank Rewards cs.LG · 2026-04-04 · unverdicted · none · ref 19
A low-rank matrix estimation method in a reward-free RL framework learns shared representations across linear MDPs and yields near-optimal policies with characterized regret bounds under relaxed feature assumptions.

Accelerating multi-task temporal difference learning under low-rank representation

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer