pith. sign in

hub Canonical reference

Masquerade: Learning from in-the-wild human videos using data-editing

Canonical reference. 71% of citing Pith papers cite this work as background.

12 Pith papers citing it
Background 71% of classified citations

hub tools

citation-role summary

background 5 method 1 other 1

citation-polarity summary

fields

cs.RO 10 cs.CV 2

years

2026 11 2025 1

verdicts

UNVERDICTED 12

representative citing papers

Bridging the Embodiment Gap: Disentangled Cross-Embodiment Video Editing

cs.RO · 2026-05-05 · unverdicted · novelty 6.0

A dual-contrastive disentanglement method factorizes videos into independent task and embodiment latents, then uses a parameter-efficient adapter on a frozen video diffusion model to synthesize robot executions from single human demonstrations without paired data.

GazeVLA: Learning Human Intention for Robotic Manipulation

cs.RO · 2026-04-24 · unverdicted · novelty 6.0

GazeVLA pretrains on large human egocentric datasets to capture gaze-based intention, then finetunes on limited robot data with chain-of-thought reasoning to achieve better robotic manipulation performance than baselines.

citing papers explorer

Showing 12 of 12 citing papers.