pith. sign in

Mixed citations

Text2reward: Automated dense reward function generation for reinforcement learning

Mixed citation behavior. Most common role is background (67%).

8 Pith papers citing it
Background 67% of classified citations

citation-role summary

background 4 method 2

citation-polarity summary

representative citing papers

Automatic Generation of High-Performance RL Environments

cs.LG · 2026-03-12 · conditional · novelty 7.0

Closed-loop prompt-based translation with hierarchical verification and iterative repair produces equivalent high-performance RL environments across five cases including new TCGJax.

citing papers explorer

Showing 8 of 8 citing papers.