pith. sign in

Mixed citations

Improving image generation with better captions.Computer Science

Mixed citation behavior. Most common role is background (60%).

8 Pith papers citing it
Background 60% of classified citations

citation-role summary

background 3 baseline 2

citation-polarity summary

fields

cs.CV 8

years

2026 6 2025 2

verdicts

UNVERDICTED 8

representative citing papers

Lance: Unified Multimodal Modeling by Multi-Task Synergy

cs.CV · 2026-05-18 · unverdicted · novelty 6.0 · 2 refs

Lance presents a dual-stream mixture-of-experts model with modality-aware positional encoding and staged multi-task training that outperforms prior open-source unified models on image and video generation while keeping strong understanding performance.

Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation

cs.CV · 2025-05-08 · unverdicted · novelty 6.0

Mogao presents a causal unified model with deep fusion, dual encoders, and interleaved position embeddings that achieves strong performance on multi-modal understanding, text-to-image generation, and coherent interleaved outputs including zero-shot editing.

citing papers explorer

Showing 8 of 8 citing papers.