pith. sign in

hub

Rlinf: Flexible and efficient large-scale reinforcement learning via macro-to-micro flow transformation

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

hub tools

citation-role summary

background 2 other 1

citation-polarity summary

years

2026 15 2025 1

polarities

background 2 unclear 1

clear filters

representative citing papers

Reinforcing VLAs in Task-Agnostic World Models

cs.AI · 2026-05-12 · unverdicted · novelty 6.0 · 2 refs

RAW-Dream disentangles world-model learning from task data by using a pre-trained task-agnostic world model and VLM rewards, with dual-noise filtering, to enable zero-shot VLA adaptation in simulation and real settings.

TacCoRL: Integrating Tactile Feedback into VLA via Simulation

cs.RO · 2026-06-10 · unverdicted · novelty 5.0

TacCoRL integrates tactile feedback into VLA policies via real-aligned simulation co-training and RL, raising average success from 50% to 72.5% on four bimanual contact-rich tasks with direct real-robot transfer.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Reinforcing VLAs in Task-Agnostic World Models cs.AI · 2026-05-12 · unverdicted · none · ref 41 · 2 links

    RAW-Dream disentangles world-model learning from task data by using a pre-trained task-agnostic world model and VLM rewards, with dual-noise filtering, to enable zero-shot VLA adaptation in simulation and real settings.