OneHOI unifies HOI generation and editing in one conditional diffusion transformer using role-aware tokens, structured attention, and joint training on mixed datasets to reach SOTA on both tasks.
High-resolution image syn- thesis with latent diffusion models, 2021
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
fields
cs.CV 3representative citing papers
PercHead achieves state-of-the-art single-image 3D head reconstruction and editing by replacing low-level losses with a perceptual loss from DINOv2 and SAM 2.1 inside a Vision Transformer architecture.
citing papers explorer
-
OneHOI: Unifying Human-Object Interaction Generation and Editing
OneHOI unifies HOI generation and editing in one conditional diffusion transformer using role-aware tokens, structured attention, and joint training on mixed datasets to reach SOTA on both tasks.
-
PercHead: Perceptual Head Model for Single-Image 3D Head Reconstruction & Editing
PercHead achieves state-of-the-art single-image 3D head reconstruction and editing by replacing low-level losses with a perceptual loss from DINOv2 and SAM 2.1 inside a Vision Transformer architecture.
- LIFT and PLACE: A Simple, Stable, and Effective Knowledge Distillation Framework for Lightweight Diffusion Models