A single motion module trained on videos adds temporally coherent animation to any personalized text-to-image model derived from the same base without additional tuning.
Taming encoder for zero fine-tuning image customization with text-to-image diffusion models
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CV 3verdicts
UNVERDICTED 3representative citing papers
HyperExpress extracts composable intrinsic concepts from single images via hyperbolic concept learning and concept-wise optimization in diffusion-based models.
OPAD enables reliable high-quality personalization of one-step diffusion models via multi-step teacher distillation combined with adversarial alignment losses.
citing papers explorer
-
AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning
A single motion module trained on videos adds temporally coherent animation to any personalized text-to-image model derived from the same base without additional tuning.
-
Intrinsic Concept Extraction Based on Compositional Interpretability
HyperExpress extracts composable intrinsic concepts from single images via hyperbolic concept learning and concept-wise optimization in diffusion-based models.
-
Adversarial Concept Distillation for One-Step Diffusion Personalization
OPAD enables reliable high-quality personalization of one-step diffusion models via multi-step teacher distillation combined with adversarial alignment losses.