VISTA combines EndoFM-LV and DINOv3 with Diverse Head Ensemble, Validation-Guided Weighted Fusion, and Anatomy-Aware Temporal Event Decoding to reach 0.3726 mAP@0.5 on hidden test for rare-pathology VCE event detection after post-competition threshold refinement.
Robust asymmetric loss for multi-label long-tailed learning
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
VISTA: Validation-Guided Integration of Spatial and Temporal Foundation Models with Anatomical Decoding for Rare-Pathology VCE Event Detection -- after competition results
VISTA combines EndoFM-LV and DINOv3 with Diverse Head Ensemble, Validation-Guided Weighted Fusion, and Anatomy-Aware Temporal Event Decoding to reach 0.3726 mAP@0.5 on hidden test for rare-pathology VCE event detection after post-competition threshold refinement.