pith. sign in

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.CV 7 cs.MM 1

years

2026 8

verdicts

UNVERDICTED 8

roles

background 2

polarities

background 2

clear filters

representative citing papers

Beyond Text Prompts: Visual-to-Visual Generation as A Unified Paradigm

cs.CV · 2026-05-12 · unverdicted · novelty 7.0 · 2 refs

Proposes V2V-Zero, a training-free framework replacing text conditioning with VLM final-layer hidden states from visual pages, achieving 0.85 on GenEval and 32.7/100 on new Simple-V2V Bench across models including video extension.

citing papers explorer

Showing 8 of 8 citing papers after filters.