pith. sign in

Canonical reference

Robomonkey: Scaling test-time sampling and verification for vision-language-action models

Canonical reference. 100% of citing Pith papers cite this work as background.

9 Pith papers citing it
Background 100% of classified citations

citation-role summary

background 5

citation-polarity summary

years

2026 8 2024 1

verdicts

UNVERDICTED 9

roles

background 5

polarities

background 5

representative citing papers

When to Act, Ask, or Learn: Uncertainty-Aware Policy Steering

cs.RO · 2026-02-25 · unverdicted · novelty 7.0

UPS framework uses conformal prediction to calibrate VLM verifiers for choosing between high-confidence action execution, natural language task queries, or policy interventions, then applies residual learning from interventions to continually improve the base policy with minimal feedback.

FASTER: Value-Guided Sampling for Fast RL

cs.LG · 2026-04-21 · unverdicted · novelty 6.0

FASTER models multi-candidate denoising as an MDP and trains a value function to filter actions early, delivering the performance of full sampling at lower cost in diffusion RL policies.

A Survey on Vision-Language-Action Models for Embodied AI

cs.RO · 2024-05-23 · unverdicted · novelty 6.0

This is the first survey on vision-language-action models, providing a taxonomy across three lines, plus summaries of datasets, simulators, benchmarks, challenges, and future directions in embodied AI.

citing papers explorer

Showing 9 of 9 citing papers.