Thus: π∗(a|gs) = 1 |A∗(gs)| = 1 |A∗(s)| =π ∗(g−1a|s).(46) Ifa /∈ A∗(gs), theng −1a /∈ A∗(s), both sides of the equation equal zero

Since gis a bijection, the cardinality is preserved:|A ∗(gs)|=|gA ∗(s)|=|A ∗(s)| · 2012

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

Reflex formalizes axial and bilateral reflection symmetries and adds symmetry regularization to PPO and SAC, reporting better performance and sample efficiency on Gym and DMC benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Reflex: Reinforcement Learning with Reflection Symmetry Exploitation in State-Based Continuous Control cs.LG · 2026-05-22 · unverdicted · none · ref 14
Reflex formalizes axial and bilateral reflection symmetries and adds symmetry regularization to PPO and SAC, reporting better performance and sample efficiency on Gym and DMC benchmarks.

Thus: π∗(a|gs) = 1 |A∗(gs)| = 1 |A∗(s)| =π ∗(g−1a|s).(46) Ifa /∈ A∗(gs), theng −1a /∈ A∗(s), both sides of the equation equal zero

fields

years

verdicts

representative citing papers

citing papers explorer