pith. sign in

hub Mixed citations

Gpt-image-edit-1.5 m: A million-scale, gpt-generated image dataset

Mixed citation behavior. Most common role is background (57%).

12 Pith papers citing it
Background 57% of classified citations

hub tools

citation-role summary

dataset 4 background 3

citation-polarity summary

fields

cs.CV 11 cs.GR 1

years

2026 9 2025 3

verdicts

UNVERDICTED 12

representative citing papers

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

cs.CV · 2026-04-06 · unverdicted · novelty 6.0

SpatialEdit provides a benchmark, large synthetic dataset, and baseline model for precise object and camera spatial manipulations in images, with the model beating priors on spatial editing.

Emu3.5: Native Multimodal Models are World Learners

cs.CV · 2025-10-30 · unverdicted · novelty 6.0

Emu3.5 is a native multimodal world model pre-trained on over 10 trillion vision-language tokens with next-token prediction, post-trained via reinforcement learning, and accelerated by Discrete Diffusion Adaptation for efficient interleaved generation and world exploration.

Bernini: Latent Semantic Planning for Video Diffusion

cs.CV · 2026-05-21 · unverdicted · novelty 5.0

Bernini is a framework that uses an MLLM planner to output semantic representations for a DiT renderer to generate or edit videos, reporting SOTA benchmark performance.

FineEdit: Fine-Grained Image Edit with Bounding Box Guidance

cs.CV · 2026-04-13 · unverdicted · novelty 5.0

FineEdit adds multi-level bounding box injection to diffusion image editing, releases a 1.2M-pair dataset with box annotations, and shows better instruction following and background consistency than prior open models on new and existing benchmarks.

citing papers explorer

Showing 12 of 12 citing papers.