pith. sign in

Adding conditional control to text-to-image diffusion models

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.CV 3

years

2026 1 2025 2

verdicts

UNVERDICTED 3

roles

background 1

polarities

background 1

representative citing papers

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

cs.CV · 2025-09-16 · unverdicted · novelty 7.0

MapAnything is a unified feed-forward transformer that regresses metric 3D scene geometry and cameras from images using a factored representation of depth maps, ray maps, poses, and scale.

Learning Zero-Shot Subject-Driven Video Generation Using 1% Compute

cs.CV · 2025-04-23 · unverdicted · novelty 6.0

A zero-shot subject-driven video generation framework that decomposes the task into identity injection from 200K subject-image pairs and motion preservation from 4K arbitrary videos, trained in 288 A100 GPU hours on CogVideoX-5B to match prior performance at 1% compute.

citing papers explorer

Showing 3 of 3 citing papers.