In our experiments, we instantiate RL-VLM- F using the same OpenAIGPT-5-minimodel as for GVL

Preference Prediction a) RL-VLM-F:RL-VLM-F [35] predicts trajectory preferences by prompting a closed-source vision–language model to compare thefinal frameof two trajectories co

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons

cs.RO · 2026-03-02 · unverdicted · novelty 6.0

Robometer combines intra-trajectory progress supervision with inter-trajectory preference supervision on a 1M-trajectory dataset to learn more generalizable robotic reward functions than prior methods.

citing papers explorer

Showing 1 of 1 citing paper.

Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons cs.RO · 2026-03-02 · unverdicted · none · ref 156
Robometer combines intra-trajectory progress supervision with inter-trajectory preference supervision on a 1M-trajectory dataset to learn more generalizable robotic reward functions than prior methods.

In our experiments, we instantiate RL-VLM- F using the same OpenAIGPT-5-minimodel as for GVL

fields

years

verdicts

representative citing papers

citing papers explorer