CineMatte uses a cross-attention design on a Siamese DINOv3 ViT plus a pretrained upsampler to produce robust mattes for virtual production, backed by a new non-synthetic 4K VP dataset that supports camera motion.
Jafar: Jack up any feature at any resolution.arXiv preprint arXiv:2506.11136, 2025
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
MuRF fuses multi-resolution features from frozen vision foundation models at inference time to create stronger representations without any training.
citing papers explorer
-
CineMatte: Background Matting for Virtual Production and Beyond
CineMatte uses a cross-attention design on a Siamese DINOv3 ViT plus a pretrained upsampler to produce robust mattes for virtual production, backed by a new non-synthetic 4K VP dataset that supports camera motion.
-
MuRF: Unlocking the Multi-Scale Potential of Vision Foundation Models
MuRF fuses multi-resolution features from frozen vision foundation models at inference time to create stronger representations without any training.