iTryOn is a video diffusion Transformer that injects spatial 3D hand guidance and semantic action captions to enable interactive garment replacement in videos.
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos , year=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Smartphone video pipeline fuses monocular depth estimation, instance segmentation, and SLAM to recover tree DBH and positions in circular plots with MAEs of 1.51 cm and 2.30 cm in managed and natural forests.
citing papers explorer
-
iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance
iTryOn is a video diffusion Transformer that injects spatial 3D hand guidance and semantic action captions to enable interactive garment replacement in videos.
-
Smartphone-based Circular Plot Sampling for Forest Inventory
Smartphone video pipeline fuses monocular depth estimation, instance segmentation, and SLAM to recover tree DBH and positions in circular plots with MAEs of 1.51 cm and 2.30 cm in managed and natural forests.