pith. sign in

Instructvid2vid: Controllable video editing with natural language instructions

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.CV 2 cs.GR 1

years

2026 1 2024 2

verdicts

UNVERDICTED 3

roles

background 1

polarities

background 1

representative citing papers

Sound Sparks Motion: Audio and Text Tuning for Video Editing

cs.GR · 2026-05-14 · unverdicted · novelty 6.0

Sound Sparks Motion is a test-time tuning approach that adjusts audio and text conditioning signals in multimodal video models using VLM feedback to produce specific motion edits while preserving content.

Movie Gen: A Cast of Media Foundation Models

cs.CV · 2024-10-17 · unverdicted · novelty 5.0

A 30B-parameter transformer and related models generate high-quality videos and audio, claiming state-of-the-art results on text-to-video, video editing, personalization, and audio generation tasks.

citing papers explorer

Showing 3 of 3 citing papers.