PhysLayer is a framework that decomposes images into depth layers, simulates physics with depth awareness, and synthesizes videos guided by language for more plausible animations.
Learning transferable visual models from natural language supervision
3 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CV 3verdicts
UNVERDICTED 3representative citing papers
FreeGraftor performs subject-driven text-to-image generation without training by cross-image feature grafting via semantic matching, position-constrained attention fusion, and a noise initialization strategy that preserves reference geometry.
Proposes GLIA framework to adapt Vision Transformers for blind image quality assessment via dual-stream global-local interaction, claiming higher accuracy and robustness with reduced parameters.
citing papers explorer
-
PhysLayer: Language-Guided Layered Animation with Depth-Aware Physics
PhysLayer is a framework that decomposes images into depth layers, simulates physics with depth awareness, and synthesizes videos guided by language for more plausible animations.
-
FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation
FreeGraftor performs subject-driven text-to-image generation without training by cross-image feature grafting via semantic matching, position-constrained attention fusion, and a noise initialization strategy that preserves reference geometry.
-
Unleashing Vision Transformer Potential In Image Quality Assessment via Global-Local Adaptive Interaction
Proposes GLIA framework to adapt Vision Transformers for blind image quality assessment via dual-stream global-local interaction, claiming higher accuracy and robustness with reduced parameters.