A geometry-aware 4D video generation model trained with cross-view pointmap alignment to produce spatio-temporally consistent future videos from novel viewpoints for robot manipulation.
Learning transferable visual models from natural language supervision
2 Pith papers cite this work. Polarity classification is still indexing.
years
2025 2verdicts
UNVERDICTED 2representative citing papers
SMPL-GPTexture uses text-to-image generation to produce dual-view human images, aligns them to SMPL meshes via 2D-to-3D recovery, projects colors to UV space, and applies diffusion inpainting to create full high-resolution textures aligned to user prompts.
citing papers explorer
-
Geometry-aware 4D Video Generation for Robot Manipulation
A geometry-aware 4D video generation model trained with cross-view pointmap alignment to produce spatio-temporally consistent future videos from novel viewpoints for robot manipulation.
-
SMPL-GPTexture: Dual-View 3D Human Texture Estimation using Text-to-Image Generation Models
SMPL-GPTexture uses text-to-image generation to produce dual-view human images, aligns them to SMPL meshes via 2D-to-3D recovery, projects colors to UV space, and applies diffusion inpainting to create full high-resolution textures aligned to user prompts.