SLIP-RS introduces a Structured-Attribute Decoupling Paradigm with contrastive learning and a conformal reliability engine to create a 15M-attribute dataset for remote sensing pre-training.
Toward open vocabulary aerial object detection with clip-activated student-teacher learning
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.CV 2years
2026 2representative citing papers
WOW-Seg proposes a word-free open-world segmentation model using Mask2Token and Cascade Attention Mask modules, reporting 89.7 semantic similarity and 82.4 semantic IoU on LVIS with one-eighth the parameters of prior SOTA plus a new 7,662-class benchmark.
citing papers explorer
-
SLIP-RS: Structured-Attribute Language-Image Pre-Training for Remote Sensing Object Detection
SLIP-RS introduces a Structured-Attribute Decoupling Paradigm with contrastive learning and a conformal reliability engine to create a 15M-attribute dataset for remote sensing pre-training.
-
WOW-Seg: A Word-free Open World Segmentation Model
WOW-Seg proposes a word-free open-world segmentation model using Mask2Token and Cascade Attention Mask modules, reporting 89.7 semantic similarity and 82.4 semantic IoU on LVIS with one-eighth the parameters of prior SOTA plus a new 7,662-class benchmark.