Geneval: An object-focused framework for evaluating text- to-image alignment

Dhruba Ghosh, Hannaneh Hajishirzi, Ludwig Schmidt · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

AIA: Rethinking Architecture Decoupling Strategy In Unified Multimodal Model

cs.CV · 2025-11-27 · unverdicted · novelty 7.0

AIA loss teaches unified multimodal models task-specific cross-modal attention patterns to reduce conflicts between image understanding and generation without architecture decoupling.

citing papers explorer

Showing 1 of 1 citing paper.

AIA: Rethinking Architecture Decoupling Strategy In Unified Multimodal Model cs.CV · 2025-11-27 · unverdicted · none · ref 12
AIA loss teaches unified multimodal models task-specific cross-modal attention patterns to reduce conflicts between image understanding and generation without architecture decoupling.

Geneval: An object-focused framework for evaluating text- to-image alignment

fields

years

verdicts

representative citing papers

citing papers explorer