pith. sign in

Nohumansrequired: Autonomous high-quality image editing triplet mining

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 1 baseline 1

citation-polarity summary

fields

cs.CV 5

years

2026 3 2025 2

verdicts

UNVERDICTED 5

representative citing papers

VideoCoF: Unified Video Editing with Temporal Reasoner

cs.CV · 2025-12-08 · unverdicted · novelty 7.0

VideoCoF adds an explicit reasoning step using edit-region latents in video diffusion models to enable precise mask-free editing and motion alignment with only 50k training pairs.

Bernini: Latent Semantic Planning for Video Diffusion

cs.CV · 2026-05-21 · unverdicted · novelty 5.0

Bernini is a framework that uses an MLLM planner to output semantic representations for a DiT renderer to generate or edit videos, reporting SOTA benchmark performance.

citing papers explorer

Showing 5 of 5 citing papers.