CineMatte uses a cross-attention design on a Siamese DINOv3 ViT plus a pretrained upsampler to produce robust mattes for virtual production, backed by a new non-synthetic 4K VP dataset that supports camera motion.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
representative citing papers
MP-ViT uses dual transformers and cross-attention on axial and sagittal MRI to classify hemorrhages, reporting 5.5% higher AUC than standard ViT and 1.8% higher than CNNs on a dataset of 12,869 subjects.
citing papers explorer
-
CineMatte: Background Matting for Virtual Production and Beyond
CineMatte uses a cross-attention design on a Siamese DINOv3 ViT plus a pretrained upsampler to produce robust mattes for virtual production, backed by a new non-synthetic 4K VP dataset that supports camera motion.
-
Multi-Plane Vision Transformer for Hemorrhage Classification Using Axial and Sagittal MRI Data
MP-ViT uses dual transformers and cross-attention on axial and sagittal MRI to classify hemorrhages, reporting 5.5% higher AUC than standard ViT and 1.8% higher than CNNs on a dataset of 12,869 subjects.